Antisense modulation of human FXR expression

ABSTRACT

Antisense compounds, compositions and methods are provided for modulating the expression of human FXR. The compositions comprise antisense compounds, particularly antisense oligonucleotides, targeted to nucleic acids encoding human FXR. Methods of using these compounds for modulation of human FXR expression and for treatment of diseases associated with expression of human FXR are provided.

FIELD OF THE INVENTION

[0001] The present invention provides compositions and methods for modulating the expression of human FXR. In particular, this invention relates to compounds, particularly oligonucleotides, specifically hybridizable with nucleic acids encoding human FXR. Such compounds have been shown to modulate the expression of human FXR.

BACKGROUND OF THE INVENTION

[0002] Steroid, thyroid and retinoid hormones produce a diverse array of physiologic effects through the regulation of gene expression. Upon entering the cell, these hormones bind to a unique group of intracellular nuclear receptors which have been characterized as ligand-dependent transcription factors. This complex then moves into the nucleus where the receptor and its cognate ligand interact with the transcription preinitiation complex affecting its stability and ultimately, the rate of transcription of the target genes. Members of the nuclear receptor family share several structural features including a central, highly conserved DNA-binding domain which targets the receptor to specific DNA sequences known as hormone response elements (Kliewer et al., Science, 1999, 284, 757-760).

[0003] The farnesoid X receptor (FXR, also known as NR1H4, retinoid X receptor-interacting protein 14; RIP14 and bile acid receptor; BAR) is a nuclear receptor that is activated by supra-physiological levels of farnesol (Forman et al., Cell, 1995, 81, 687-693).

[0004] Disclosed and claimed in U.S. Pat. No. 6,005,086 is the nucleic acid sequence coding for a mammalian FXR protein (Evans et al., 1999).

[0005] Bile acids are complex physiological molecules that are essential for solubilization, absorption and transport of dietary lipids in the intestine (Tu et al., Trends Cardiovasc. Med., 2000, 10, 30-35). Among the genes regulated by bile acids are cholesterol 7-alpha-hydroxylase (CYP7a), the rate limiting enzyme in the breakdown of cholesterol to bile acids and the intestinal bile acid-binding protein (I-BABP) which serves as a component of the bile transport system (Parks et al., Science, 1999, 284, 1365-1368).

[0006] When activated via ligand binding, FXR is known to repress synthesis of CYP7a and activate synthesis of I-BABP (Walters, Gut, 2000, 46, 308-309). The most active specific bile acid ligand for FXR is chenodeoxycholic acid (CDCA), a known regulator of several genes that participate in cholesterol/bile acid homeostasis including CYP7a and I-BABP (Kliewer et al., Science, 1999, 284, 757-760).

[0007] Hypercholesterolemia, one of the major contributing factors to atherosclerosis, is associated with high levels of low-density lipoprotein (LDL) cholesterol, an indicator of aberrant cholesterol homeostasis.

[0008] Since bile acids are involved in several aspects of cholesterol and lipid homeostasis, there is a need for new strategies targeting bile acid metabolic pathways in treatment of cardiovascular disease. Investigative strategies aimed at modulating FXR function have been so far limited to the use of small molecule inhibitors (Maloney et al., J. Med. Chem., 2000, 43, 2971-2974) and are as yet untested as therapeutic protocols. Consequently there remains a long felt need for agents capable of effectively modulating FXR function.

[0009] Antisense technology is emerging as an effective means for reducing the expression of specific gene products and may therefore prove to be uniquely useful in a number of therapeutic, diagnostic, and research applications for the modulation of FXR expression.

[0010] The present invention provides compositions and methods for modulating human FXR expression.

SUMMARY OF THE INVENTION

[0011] The present invention is directed to compounds, particularly antisense oligonucleotides, which are targeted to a nucleic acid encoding FXR, and which modulate the expression of human FXR. Pharmaceutical and other compositions comprising the compounds of the invention are also provided. Further provided are methods of modulating the expression of human FXR in cells or tissues comprising contacting said cells or tissues with one or more of the antisense compounds or compositions of the invention. Further provided are methods of treating a human suspected of having or being prone to a disease or condition associated with expression of FXR by administering a therapeutically or prophylactically effective amount of one or more of the antisense compounds or compositions of the invention.

DETAILED DESCRIPTION OF THE INVENTION

[0012] The present invention employs oligomeric compounds, particularly antisense oligonucleotides, for use in modulating the function of nucleic acid molecules encoding human FXR, ultimately modulating the amount of human FXR produced. This is accomplished by providing antisense compounds which specifically hybridize with one or more nucleic acids encoding human FXR. As used herein, the terms “target nucleic acid” and “nucleic acid encoding FXR” encompass DNA encoding FXR, RNA (including pre-mRNA and mRNA) transcribed from such DNA, and also cDNA derived from such RNA. The specific hybridization of an oligomeric compound with its target nucleic acid interferes with the normal function of the nucleic acid. This modulation of function of a target nucleic acid by compounds which specifically hybridize to it is generally referred to as “antisense”. The functions of DNA to be interfered with include replication and transcription. The functions of RNA to be interfered with include all vital functions such as, for example, translocation of the RNA to the site of protein translation, translation of protein from the RNA, splicing of the RNA to yield one or more mRNA species, and catalytic activity which may be engaged in or facilitated by the RNA. The overall effect of such interference with target nucleic acid function is modulation of the expression of FXR. In the context of the present invention, “modulation” means either an increase (stimulation) or a decrease (inhibition) in the expression of a gene. In the context of the present invention, inhibition is the preferred form of modulation of gene expression and mRNA is a preferred target.

[0013] It is preferred to target specific nucleic acids for antisense. “Targeting” an antisense compound to a particular nucleic acid, in the context of this invention, is a multistep process. The process usually begins with the identification of a nucleic acid sequence whose function is to be modulated. This may be, for example, a cellular gene (or mRNA transcribed from the gene) whose expression is associated with a particular disorder or disease state, or a nucleic acid molecule from an infectious agent. In the present invention, the target is a nucleic acid molecule encoding FXR. The targeting process also includes determination of a site or sites within this gene for the antisense interaction to occur such that the desired effect, e.g., detection or modulation of expression of the protein, will result. Within the context of the present invention, a preferred intragenic site is the region encompassing the translation initiation or termination codon of the open reading frame (ORF) of the gene. Since, as is known in the art, the translation initiation codon is typically 5′-AUG (in transcribed mRNA molecules; 5′-ATG in the corresponding DNA molecule), the translation initiation codon is also referred to as the “AUG codon,” the “start codon” or the “AUG start codon”. A minority of genes have a translation initiation codon having the RNA sequence 5′-GUG, 5′-UUG or 5′-CUG, and 5′-AUA, 5′-ACG and 5′-CUG have been shown to function in vivo. Thus, the terms “translation initiation codon” and “start codon” can encompass many codon sequences, even though the initiator amino acid in each instance is typically methionine (in eukaryotes) or formylmethionine (in prokaryotes). It is also known in the art that eukaryotic and prokaryotic genes may have two or more alternative start codons, any one of which may be preferentially utilized for translation initiation in a particular cell type or tissue, or under a particular set of conditions. In the context of the invention, “start codon” and “translation initiation codon” refer to the codon or codons that are used in vivo to initiate translation of an mRNA molecule transcribed from a gene encoding FXR, regardless of the sequence(s) of such codons.

[0014] It is also known in the art that a translation termination codon (or “stop codon”) of a gene may have one of three sequences, i.e., 5′-UAA, 5′-UAG and 5′-UGA (the corresponding DNA sequences are 5′-TAA, 5′-TAG and 5′-TGA, respectively). The terms “start codon region” and “translation initiation codon region” refer to a portion of such an mRNA or gene that encompasses from about 25 to about 50 contiguous nucleotides in either direction (i.e., 5′ or 3′) from a translation initiation codon. Similarly, the terms “stop codon region” and “translation termination codon region” refer to a portion of such an mRNA or gene that encompasses from about 25 to about 50 contiguous nucleotides in either direction (i.e., 5′ or 3′) from a translation termination codon.

[0015] The open reading frame (ORF) or “coding region,” which is known in the art to refer to the region between the translation initiation codon and the translation termination codon, is also a region which may be targeted effectively. Other target regions include the 5′ untranslated region (5′UTR), known in the art to refer to the portion of an mRNA in the 5′ direction from the translation initiation codon, and thus including nucleotides between the 5′ cap site and the translation initiation codon of an mRNA or corresponding nucleotides on the gene, and the 3′ untranslated region (3′UTR), known in the art to refer to the portion of an mRNA in the 3′ direction from the translation termination codon, and thus including nucleotides between the translation termination codon and 3′ end of an mRNA or corresponding nucleotides on the gene. The 5′ cap of an mRNA comprises an N7-methylated guanosine residue joined to the 5′-most residue of the mRNA via a 5′-5′ triphosphate linkage. The 5′ cap region of an mRNA is considered to include the 5′ cap structure itself as well as the first 50 nucleotides adjacent to the cap. The 5′ cap region may also be a preferred target region.

[0016] Although some eukaryotic mRNA transcripts are directly translated, many contain one or more regions, known as “introns,” which are excised from a transcript before it is translated. The remaining (and therefore translated) regions are known as “exons” and are spliced together to form a continuous mRNA sequence. mRNA splice sites, i.e., intron-exon junctions, may also be preferred target regions, and are particularly useful in situations where aberrant splicing is implicated in disease, or where an overproduction of a particular mRNA splice product is implicated in disease. Aberrant fusion junctions due to rearrangements or deletions are also preferred targets. It has also been found that introns can also be effective, and therefore preferred, target regions for antisense compounds targeted, for example, to DNA or pre-mRNA.

[0017] Once one or more target sites have been identified, oligonucleotides are chosen which are sufficiently complementary to the target, i.e., hybridize sufficiently well and with sufficient specificity, to give the desired effect.

[0018] In the context of this invention, “hybridization” means hydrogen bonding, which may be Watson-Crick, Hoogsteen or reversed Hoogsteen hydrogen bonding, between complementary nucleoside or nucleotide bases. For example, adenine and thymine are complementary nucleobases which pair through the formation of hydrogen bonds. “Complementary,” as used herein, refers to the capacity for precise pairing between two nucleotides. For example, if a nucleotide at a certain position of an oligonucleotide is capable of hydrogen bonding with a nucleotide at the same position of a DNA or RNA molecule, then the oligonucleotide and the DNA or RNA are considered to be complementary to each other at that position. The oligonucleotide and the DNA or RNA are complementary to each other when a sufficient number of corresponding positions in each molecule are occupied by nucleotides which can hydrogen bond with each other. Thus, “specifically hybridizable” and “complementary” are terms which are used to indicate a sufficient degree of complementarity or precise pairing such that stable and specific binding occurs between the oligonucleotide and the DNA or RNA target. It is understood in the art that the sequence of an antisense compound need not be 100% complementary to that of its target nucleic acid to be specifically hybridizable. An antisense compound is specifically hybridizable when binding of the compound to the target DNA or RNA molecule interferes with the normal function of the target DNA or RNA to cause a loss of utility, and there is a sufficient degree of complementarity to avoid non-specific binding of the antisense compound to non-target sequences under conditions in which specific binding is desired, i.e., under physiological conditions in the case of in vivo assays or therapeutic treatment, and in the case of in vitro assays, under conditions in which the assays are performed.

[0019] Antisense and other compounds of the invention which hybridize to the target and inhibit expression of the target are identified through experimentation, and the sequences of these compounds are hereinbelow identified as preferred embodiments of the invention. The target sites to which these preferred sequences are complementary are hereinbelow referred to as “active sites” and are therefore preferred sites for targeting. Therefore another embodiment of the invention encompasses compounds which hybridize to these active sites.

[0020] Antisense compounds are commonly used as research reagents and diagnostics. For example, antisense oligonucleotides, which are able to inhibit gene expression with exquisite specificity, are often used by those of ordinary skill to elucidate the function of particular genes. Antisense compounds are also used, for example, to distinguish between functions of various members of a biological pathway. Antisense modulation has, therefore, been harnessed for research use.

[0021] For use in kits and diagnostics, the antisense compounds of the present invention, either alone or in combination with other antisense compounds or therapeutics, can be used as tools in differential and/or combinatorial analyses to elucidate expression patterns of a portion or the entire complement of genes expressed within cells and tissues.

[0022] Expression patterns within cells or tissues treated with one or more antisense compounds are compared to control cells or tissues not treated with antisense compounds and the patterns produced are analyzed for differential levels of gene expression as they pertain, for example, to disease association, signaling pathway, cellular localization, expression level, size, structure or function of the genes examined. These analyses can be performed on stimulated or unstimulated cells and in the presence or absence of other compounds which affect expression patterns.

[0023] Examples of methods of gene expression analysis known in the art include DNA arrays or microarrays (Brazma and Vilo, FEBS Lett., 2000, 480, 17-24; Celis, et al., FEBS Lett., 2000, 480, 2-16), SAGE (serial analysis of gene expression)(Madden, et al., Drug Discov. Today, 2000, 5, 415-425), READS (restriction enzyme amplification of digested cDNAs) (Prashar and Weissman, Methods Enzymol., 1999, 303, 258-72), TOGA (total gene expression analysis) (Sutcliffe, et al., Proc. Natl. Acad. Sci. U.S.A., 2000, 97, 1976-81), protein arrays and proteomics (Celis, et al., FEBS Lett., 2000, 480, 2-16; Jungblut, et al., Electrophoresis, 1999, 20, 2100-10), expressed sequence tag (EST) sequencing (Celis, et al., FEBS Lett., 2000, 480, 2-16; Larsson, et al., J. Biotechnol., 2000, 80, 143-57), subtractive RNA fingerprinting (SuRF) (Fuchs, et al., Anal. Biochem., 2000, 286, 91-98; Larson, et al., Cytometry, 2000, 41, 203-208), subtractive cloning, differential display (DD) (Jurecic and Belmont, Curr. Opin. Microbiol., 2000, 3, 316-21), comparative genomic hybridization (Carulli, et al., J. Cell Biochem. Suppl., 1998, 31, 286-96), FISH (fluorescent in situ hybridization) techniques (Going and Gusterson, Eur. J. Cancer, 1999, 35, 1895-904) and mass spectrometry methods (reviewed in (To, Comb. Chem. High Throughput Screen, 2000, 3, 235-41).

[0024] The specificity and sensitivity of antisense is also harnessed by those of skill in the art for therapeutic uses. Antisense oligonucleotides have been employed as therapeutic moieties in the treatment of disease states in animals and man. Antisense oligonucleotide drugs, including ribozymes, have been safely and effectively administered to humans and numerous clinical trials are presently underway. It is thus established that oligonucleotides can be useful therapeutic modalities that can be configured to be useful in treatment regimes for treatment of cells, tissues and animals, especially humans.

[0025] In the context of this invention, the term “oligonucleotide” refers to an oligomer or polymer of ribonucleic acid (RNA) or deoxyribonucleic acid (DNA) or mimetics thereof. This term includes oligonucleotides composed of naturally-occurring nucleobases, sugars and covalent internucleoside (backbone) linkages as well as oligonucleotides having non-naturally-occurring portions which function similarly. Such modified or substituted oligonucleotides are often preferred over native forms because of desirable properties such as, for example, enhanced cellular uptake, enhanced affinity for nucleic acid target and increased stability in the presence of nucleases.

[0026] While antisense oligonucleotides are a preferred form of antisense compound, the present invention comprehends other oligomeric antisense compounds, including but not limited to oligonucleotide mimetics such as are described below. The antisense compounds in accordance with this invention preferably comprise from about 8 to about 50 nucleobases (i.e. from about 8 to about 50 linked nucleosides). Particularly preferred antisense compounds are antisense oligonucleotides, even more preferably those comprising from about 12 to about 30 nucleobases. Antisense compounds include ribozymes, external guide sequence (EGS) oligonucleotides (oligozymes), and other short catalytic RNAs or catalytic oligonucleotides which hybridize to the target nucleic acid and modulate its expression.

[0027] As is known in the art, a nucleoside is a base-sugar combination. The base portion of the nucleoside is normally a heterocyclic base. The two most common classes of such heterocyclic bases are the purines and the pyrimidines. Nucleotides are nucleosides that further include a phosphate group covalently linked to the sugar portion of the nucleoside. For those nucleosides that include a pentofuranosyl sugar, the phosphate group can be linked to either the 2′, 3′ or 5′ hydroxyl moiety of the sugar. In forming oligonucleotides, the phosphate groups covalently link adjacent nucleosides to one another to form a linear polymeric compound. In turn the respective ends of this linear polymeric structure can be further joined to form a circular structure, however, open linear structures are generally preferred. Within the oligonucleotide structure, the phosphate groups are commonly referred to as forming the internucleoside backbone of the oligonucleotide. The normal linkage or backbone of RNA and DNA is a 3′ to 5′ phosphodiester linkage.

[0028] Specific examples of preferred antisense compounds useful in this invention include oligonucleotides containing modified backbones or non-natural internucleoside linkages. As defined in this specification, oligonucleotides having modified backbones include those that retain a phosphorus atom in the backbone and those that do not have a phosphorus atom in the backbone. For the purposes of this specification, and as sometimes referenced in the art, modified oligonucleotides that do not have a phosphorus atom in their internucleoside backbone can also be considered to be oligonucleosides.

[0029] Preferred modified oligonucleotide backbones include, for example, phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, aminoalkylphosphotri-esters, methyl and other alkyl phosphonates including 3′-alkylene phosphonates, 5′-alkylene phosphonates and chiral phosphonates, phosphinates, phosphoramidates including 3′-amino phosphoramidate and aminoalkylphosphoramidates, thionophosphoramidates, thionoalkylphosphonates, thionoalkylphosphotriesters, selenophosphates and borano-phosphates having normal 3′-5′ linkages, 2′-5′ linked analogs of these, and those having inverted polarity wherein one or more internucleotide linkages is a 3′ to 3′, 5′ to 5′ or 2′ to 2′ linkage. Preferred oligonucleotides having inverted polarity comprise a single 3′ to 3′ linkage at the 3′-most internucleotide linkage i.e. a single inverted nucleoside residue which may be abasic (the nucleobase is missing or has a hydroxyl group in place thereof). Various salts, mixed salts and free acid forms are also included.

[0030] Representative United States patents that teach the preparation of the above phosphorus-containing linkages include, but are not limited to, U.S. Pat Nos. 3,687,808; 4,469,863; 4,476,301; 5,023,243; 5,177,196; 5,188,897; 5,264,423; 5,276,019; 5,278,302; 5,286,717; 5,321,131; 5,399,676; 5,405,939; 5,453,496; 5,455,233; 5,466,677; 5,476,925; 5,519,126; 5,536,821; 5,541,306; 5,550,111; 5,563,253; 5,571,799; 5,587,361; 5,194,599; 5,565,555; 5,527,899; 5,721,218; 5,672,697 and 5,625,050, certain of which are commonly owned with this application, and each of which is herein incorporated by reference.

[0031] Preferred modified oligonucleotide backbones that do not include a phosphorus atom therein have backbones that are formed by short chain alkyl or cycloalkyl internucleoside linkages, mixed heteroatom and alkyl or cycloalkyl internucleoside linkages, or one or more short chain heteroatomic or heterocyclic internucleoside linkages. These include those having morpholino linkages (formed in part from the sugar portion of a nucleoside); siloxane backbones; sulfide, sulfoxide and sulfone backbones; formacetyl and thioformacetyl backbones; methylene formacetyl and thioformacetyl backbones; riboacetyl backbones; alkene containing backbones; sulfamate backbones; methyleneimino and methylenehydrazino backbones; sulfonate and sulfonamide backbones; amide backbones; and others having mixed N, O, S and CH₂ component parts.

[0032] Representative United States patents that teach the preparation of the above oligonucleosides include, but are not limited to, U.S. Pat. Nos. 5,034,506; 5,166,315; 5,185,444; 5,214,134; 5,216,141; 5,235,033; 5,264,562; 5,264,564; 5,405,938; 5,434,257; 5,466,677; 5,470,967; 5,489,677; 5,541,307; 5,561,225; 5,596,086; 5,602,240; 5,610,289; 5,602,240; 5,608,046; 5,610,289; 5,618,704; 5,623,070; 5,663,312; 5,633,360; 5,677,437; 5,792,608; 5,646,269 and 5,677,439, certain of which are commonly owned with this application, and each of which is herein incorporated by reference.

[0033] In other preferred oligonucleotide mimetics, both the sugar and the internucleoside linkage, i.e., the backbone, of the nucleotide units are replaced with novel groups. The base units are maintained for hybridization with an appropriate nucleic acid target compound. One such oligomeric compound, an oligonucleotide mimetic that has been shown to have excellent hybridization properties, is referred to as a peptide nucleic acid (PNA). In PNA compounds, the sugar-backbone of an oligonucleotide is replaced with an amide containing backbone, in particular an aminoethylglycine backbone. The nucleobases are retained and are bound directly or indirectly to aza nitrogen atoms of the amide portion of the backbone. Representative United States patents that teach the preparation of PNA compounds include, but are not limited to, U.S. Pat. Nos.: 5,539,082; 5,714,331; and 5,719,262, each of which is herein incorporated by reference. Further teaching of PNA compounds can be found in Nielsen et al., Science, 1991, 254, 1497-1500.

[0034] Most preferred embodiments of the invention are oligonucleotides with phosphorothioate backbones and oligonucleosides with heteroatom backbones, and in particular —CH₂—NH—O—CH₂—, —CH₂—N(CH₃)—O—CH₂— [known as a methylene (methylimino) or MMI backbone], —CH₂—O—N(CH₃)—CH₂—, —CH₂—N(CH₃)—N(CH₃)—CH₂— and —O—N(CH₃)—CH₂—CH₂— [wherein the native phosphodiester backbone is represented as —O—P—O—CH₂—] of the above referenced U.S. Pat. No. 5,489,677, and the amide backbones of the above referenced U.S. Pat. No. 5,602,240. Also preferred are oligonucleotides having morpholino backbone structures of the above-referenced U.S. Pat. No. 5,034,506.

[0035] Modified oligonucleotides may also contain one or more substituted sugar moieties. Preferred oligonucleotides comprise one of the following at the 2′ position: OH; F; O—, S—, or N-alkyl; O—, S—, or N-alkenyl; O—, S—or N-alkynyl; or O-alkyl-O-alkyl, wherein the alkyl, alkenyl and alkynyl may be substituted or unsubstituted C₁ to C₁₀ alkyl or C₂ to C₁₀ alkenyl and alkynyl. Particularly preferred are O[(CH₂)_(n)O]_(m)CH₃, O(CH₂)_(n)OCH₃, O(CH₂)_(n)NH₂, O(CH₂)_(n)CH₃, O(CH₂)_(n)ONH₂, and O(CH₂)_(n)ON[(CH₂)_(n)CH₃)]₂, where n and m are from 1 to about 10. Other preferred oligonucleotides comprise one of the following at the 2′ position: C₁ to C₁₀ lower alkyl, substituted lower alkyl, alkenyl, alkynyl, alkaryl, aralkyl, O-alkaryl or O-aralkyl, SH, SCH₃, OCN, Cl, Br, CN, CF₃, OCF₃, SOCH₃, SO₂CH₃, ONO₂, NO₂, N₃, NH₂, heterocycloalkyl, heterocycloalkaryl, aminoalkylamino, polyalkylamino, substituted silyl, an RNA cleaving group, a reporter group, an intercalator, a group for improving the pharmacokinetic properties of an oligonucleotide, or a group for improving the pharmacodynamic properties of an oligonucleotide, and other substituents having similar properties. A preferred modification includes 2′-methoxyethoxy (2′-O—CH₂CH₂OCH₃, also known as 2′-O-(2-methoxyethyl) or 2′-MOE) (Martin et al., Helv. Chim. Acta, 1995, 78, 486-504) i.e., an alkoxyalkoxy group. A further preferred modification includes 2′-dimethylaminooxyethoxy, i.e., a O(CH₂)₂ON(CH₃)₂ group, also known as 2′-DMAOE, as described in examples hereinbelow, and 2′-dimethylaminoethoxyethoxy (also known in the art as 2′-O-dimethylaminoethoxyethyl or 2′-DMAEOE), i.e., 2′-O—CH₂—O—CH₂—N(CH₂)₂, also described in examples hereinbelow.

[0036] A further prefered modification includes Locked Nucleic Acids (LNAs) in which the 2′-hydroxyl group is linked to the 3′ or 4′ carbon atom of the sugar ring thereby forming a bicyclic sugar moiety. The linkage is preferably a methelyne (—CH₂—)_(n) group bridging the 2′ oxygen atom and the 4′ carbon atom wherein n is 1 or 2. LNAs and preparation thereof are described in WO 98/39352 and WO 99/14226.

[0037] Other preferred modifications include 2′-methoxy (2′-O—CH₃), 2′-aminopropoxy (2′-OCH₂CH₂CH₂NH₂), 2′-allyl (2′-CH₂—CH═CH₂), 2′-O-allyl (2′-O—CH₂—CH═CH₂) and 2′-fluoro (2′-F). The 2′-modification may be in the arabino (up) position or ribo (down) position. A preferred 2′-arabino modification is 2′-F. Similar modifications may also be made at other positions on the oligonucleotide, particularly the 3′ position of the sugar on the 3′ terminal nucleotide or in 2′-5′ linked oligonucleotides and the 5′ position of 5′ terminal nucleotide. Oligonucleotides may also have sugar mimetics such as cyclobutyl moieties in place of the pentofuranosyl sugar. Representative United States patents that teach the preparation of such modified sugar structures include, but are not limited to, U.S. Pat. Nos. 4,981,957; 5,118,800; 5,319,080; 5,359,044; 5,393,878; 5,446,137; 5,466,786; 5,514,785; 5,519,134; 5,567,811; 5,576,427; 5,591,722; 5,597,909; 5,610,300; 5,627,053; 5,639,873; 5,646,265; 5,658,873; 5,670,633; 5,792,747; and 5,700,920, certain of which are commonly owned with the instant application, and each of which is herein incorporated by reference in its entirety.

[0038] Oligonucleotides may also include nucleobase (often referred to in the art simply as “base”) modifications or substitutions. As used herein, “unmodified” or “natural” nucleobases include the purine bases adenine (A) and guanine (G), and the pyrimidine bases thymine (T), cytosine (C) and uracil (U). Modified nucleobases include other synthetic and natural nucleobases such as 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 6-methyl and other alkyl derivatives of adenine and guanine, 2-propyl and other alkyl derivatives of adenine and guanine, 2-thiouracil, 2-thiothymine and 2-thiocytosine, 5-halouracil and cytosine, 5-propynyl (—C≡C—CH₃) uracil and cytosine and other alkynyl derivatives of pyrimidine bases, 6-azo uracil, cytosine and thymine, 5-uracil (pseudouracil), 4-thiouracil, 8-halo, 8-amino, 8-thiol, 8-thioalkyl, 8-hydroxyl and other 8-substituted adenines and guanines, 5-halo particularly 5-bromo, 5-trifluoromethyl and other 5-substituted uracils and cytosines, 7-methylguanine and 7-methyladenine, 2-F-adenine, 2-amino-adenine, 8-azaguanine and 8-azaadenine, 7-deazaguanine and 7-deazaadenine and 3-deazaguanine and 3-deazaadenine. Further modified nucleobases include tricyclic pyrimidines such as phenoxazine cytidine(1H-pyrimido[5,4-b][1,4]benzoxazin-2(3H)-one), phenothiazine cytidine (1H-pyrimido[5,4-b][1,4]benzothiazin-2(3H)-one), G-clamps such as a substituted phenoxazine cytidine (e.g. 9-(2-aminoethoxy)-H-pyrimido[5,4-b][1,4]benzoxazin-2(3H)-one), carbazole cytidine (2H-pyrimido[4,5-b]indol-2-one), pyridoindole cytidine (H-pyrido[3′,2′:4,5]pyrrolo[2,3-d]pyrimidin-2-one). Modified nucleobases may also include those in which the purine or pyrimidine base is replaced with other heterocycles, for example 7-deaza-adenine, 7-deazaguanosine, 2-aminopyridine and 2-pyridone. Further nucleobases include those disclosed in U.S. Pat. No. 3,687,808, those disclosed in The Concise Encyclopedia Of Polymer Science And Engineering, pages 858-859, Kroschwitz, J. I., ed. John Wiley & Sons, 1990, those disclosed by Englisch et al., Angewandte Chemie, International Edition, 1991, 30, 613, and those disclosed by Sanghvi, Y. S., Chapter 15, Antisense Research and Applications, pages 289-302, Crooke, S. T. and Lebleu, B., ed., CRC Press, 1993. Certain of these nucleobases are particularly useful for increasing the binding affinity of the oligomeric compounds of the invention. These include 5-substituted pyrimidines, 6-azapyrimidines and N-2, N-6 and O-6 substituted purines, including 2-aminopropyladenine, 5-propynyluracil and 5-propynylcytosine. 5-methylcytosine substitutions have been shown to increase nucleic acid duplex stability by 0.6-1.2° C. (Sanghvi, Y. S., Crooke, S. T. and Lebleu, B., eds., Antisense Research and Applications, CRC Press, Boca Raton, 1993, pp. 276-278) and are presently preferred base substitutions, even more particularly when combined with 2′-O-methoxyethyl sugar modifications.

[0039] Representative United States patents that teach the preparation of certain of the above noted modified nucleobases as well as other modified nucleobases include, but are not limited to, the above noted U.S. Pat. No. 3,687,808, as well as U.S. Pat. Nos. 4,845,205; 5,130,302; 5,134,066; 5,175,273; 5,367,066; 5,432,272; 5,457,187; 5,459,255; 5,484,908; 5,502,177; 5,525,711; 5,552,540; 5,587,469; 5,594,121, 5,596,091; 5,614,617; 5,645,985; 5,830,653; 5,763,588; 6,005,096; and 5,681,941, certain of which are commonly owned with the instant application, and each of which is herein incorporated by reference, and U.S. Pat. No. 5,750,692, which is commonly owned with the instant application and also herein incorporated by reference.

[0040] Another modification of the oligonucleotides of the invention involves chemically linking to the oligonucleotide one or more moieties or conjugates which enhance the activity, cellular distribution or cellular uptake of the oligonucleotide. The compounds of the invention can include conjugate groups covalently bound to functional groups such as primary or secondary hydroxyl groups. Conjugate groups of the invention include intercalators, reporter molecules, polyamines, polyamides, polyethylene glycols, polyethers, groups that enhance the pharmacodynamic properties of oligomers, and groups that enhance the pharmacokinetic properties of oligomers. Typical conjugates groups include cholesterols, lipids, phospholipids, biotin, phenazine, folate, phenanthridine, anthraquinone, acridine, fluores-ceins, rhodamines, coumarins, and dyes. Groups that enhance the pharmacodynamic properties, in the context of this invention, include groups that improve oligomer uptake, enhance oligomer resistance to degradation, and/or strengthen sequence-specific hybridization with RNA. Groups that enhance the pharmacokinetic properties, in the context of this invention, include groups that improve oligomer uptake, distribution, metabolism or excretion. Representative conjugate groups are disclosed in International Patent Application PCT/US92/09196, filed Oct. 23, 1992 the entire disclosure of which is incorporated herein by reference. Conjugate moieties include but are not limited to lipid moieties such as a cholesterol moiety (Letsinger et al., Proc. Natl. Acad. Sci. USA, 1989, 86, 6553-6556), cholic acid (Manoharan et al., Bioorg. Med. Chem. Let., 1994, 4, 1053-1060), a thioether, e.g., hexyl-S-tritylthiol (Manoharan et al., Ann. N.Y. Acad. Sci., 1992, 660, 306-309; Manoharan et al., Bioorg. Med. Chem. Let., 1993, 3, 2765-2770), a thiocholesterol (Oberhauser et al., Nucl. Acids Res., 1992, 20, 533-538), an aliphatic chain, e.g., dodecandiol or undecyl residues (Saison-Behmoaras et al., EMBO J., 1991, 10, 1111-1118; Kabanov et al., FEBS Lett., 1990, 259, 327-330; Svinarchuk et al., Biochimie, 1993, 75, 49-54), a phospholipid, e.g., di-hexadecyl-rac-glycerol or triethyl-ammonium 1,2-di-O-hexadecyl-rac-glycero-3-H-phosphonate (Manoharan et al., Tetrahedron Lett., 1995, 36, 3651-3654; Shea et al., Nucl. Acids Res., 1990, 18, 3777-3783), a polyamine or a polyethylene glycol chain (Manoharan et al., Nucleosides & Nucleotides, 1995, 14, 969-973), or adamantane acetic acid (Manoharan et al., Tetrahedron Lett., 1995, 36, 3651-3654), a palmityl moiety (Mishra et al., Biochim. Biophys. Acta, 1995, 1264, 229-237), or an octadecylamine or hexylamino-carbonyl-oxycholesterol moiety (Crooke et al., J. Pharmacol. Exp. Ther., 1996, 277, 923-937. Oligonucleotides of the invention may also be conjugated to active drug substances, for example, aspirin, warfarin, phenylbutazone, ibuprofen, suprofen, fenbufen, ketoprofen, (S)-(+)-pranoprofen, carprofen, dansylsarcosine, 2,3,5-triiodobenzoic acid, flufenamic acid, folinic acid, a benzothiadiazide, chlorothiazide, a diazepine, indomethicin, a barbiturate, a cephalosporin, a sulfa drug, an antidiabetic, an antibacterial or an antibiotic. Oligonucleotide-drug conjugates and their preparation are described in U.S. patent application Ser. No. 09/334,130 (filed Jun. 15, 1999) which is incorporated herein by reference in its entirety.

[0041] Representative United States patents that teach the preparation of such oligonucleotide conjugates include, but are not limited to, U.S. Pat. Nos. 4,828,979; 4,948,882; 5,218,105; 5,525,465; 5,541,313; 5,545,730; 5,552,538; 5,578,717, 5,580,731; 5,580,731; 5,591,584; 5,109,124; 5,118,802; 5,138,045; 5,414,077; 5,486,603; 5,512,439; 5,578,718; 5,608,046; 4,587,044; 4,605,735; 4,667,025; 4,762,779; 4,789,737; 4,824,941; 4,835,263; 4,876,335; 4,904,582; 4,958,013; 5,082,830; 5,112,963; 5,214,136; 5,082,830; 5,112,963; 5,214,136; 5,245,022; 5,254,469; 5,258,506; 5,262,536; 5,272,250; 5,292,873; 5,317,098; 5,371,241, 5,391,723; 5,416,203, 5,451,463; 5,510,475; 5,512,667; 5,514,785; 5,565,552; 5,567,810; 5,574,142; 5,585,481; 5,587,371; 5,595,726; 5,597,696; 5,599,923; 5,599,928 and 5,688,941, certain of which are commonly owned with the instant application, and each of which is herein incorporated by reference.

[0042] It is not necessary for all positions in a given compound to be uniformly modified, and in fact more than one of the aforementioned modifications may be incorporated in a single compound or even at a single nucleoside within an oligonucleotide. The present invention also includes antisense compounds which are chimeric compounds. “Chimeric” antisense compounds or “chimeras,” in the context of this invention, are antisense compounds, particularly oligonucleotides, which contain two or more chemically distinct regions, each made up of at least one monomer unit, i.e., a nucleotide in the case of an oligonucleotide compound. These oligonucleotides typically contain at least one region wherein the oligonucleotide is modified so as to confer upon the oligonucleotide increased resistance to nuclease degradation, increased cellular uptake, and/or increased binding affinity for the target nucleic acid. An additional region of the oligonucleotide may serve as a substrate for enzymes capable of cleaving RNA:DNA or RNA:RNA hybrids. By way of example, RNase H is a cellular endonuclease which cleaves the RNA strand of an RNA:DNA duplex. Activation of RNase H, therefore, results in cleavage of the RNA target, thereby greatly enhancing the efficiency of oligonucleotide inhibition of gene expression. Consequently, comparable results can often be obtained with shorter oligonucleotides when chimeric oligonucleotides are used, compared to phosphorothioate deoxyoligonucleotides hybridizing to the same target region. Cleavage of the RNA target can be routinely detected by gel electrophoresis and, if necessary, associated nucleic acid hybridization techniques known in the art.

[0043] Chimeric antisense compounds of the invention may be formed as composite structures of two or more oligonucleotides, modified oligonucleotides, oligonucleosides and/or oligonucleotide mimetics as described above. Such compounds have also been referred to in the art as hybrids or gapmers. Representative United States patents that teach the preparation of such hybrid structures include, but are not limited to, U.S. Pat. Nos. 5,013,830; 5,149,797; 5,220,007; 5,256,775; 5,366,878; 5,403,711; 5,491,133; 5,565,350; 5,623,065; 5,652,355; 5,652,356; and 5,700,922, certain of which are commonly owned with the instant application, and each of which is herein incorporated by reference in its entirety.

[0044] The antisense compounds used in accordance with this invention may be conveniently and routinely made through the well-known technique of solid phase synthesis. Equipment for such synthesis is sold by several vendors including, for example, Applied Biosystems (Foster City, Calif.). Any other means for such synthesis known in the art may additionally or alternatively be employed. It is well known to use similar techniques to prepare oligonucleotides such as the phosphorothioates and alkylated derivatives.

[0045] The antisense compounds of the invention are synthesized in vitro and do not include antisense compositions of biological origin, or genetic vector constructs designed to direct the in vivo synthesis of antisense molecules. The compounds of the invention may also be admixed, encapsulated, conjugated or otherwise associated with other molecules, molecule structures or mixtures of compounds, as for example, liposomes, receptor targeted molecules, oral, rectal, topical or other formulations, for assisting in uptake, distribution and/or absorption. Representative United States patents that teach the preparation of such uptake, distribution and/or absorption assisting formulations include, but are not limited to, U.S. Pat. Nos. 5,108,921; 5,354,844; 5,416,016; 5,459,127; 5,521,291; 5,543,158; 5,547,932; 5,583,020; 5,591,721; 4,426,330; 4,534,899; 5,013,556; 5,108,921; 5,213,804; 5,227,170; 5,264,221; 5,356,633; 5,395,619; 5,416,016; 5,417,978; 5,462,854; 5,469,854; 5,512,295; 5,527,528; 5,534,259; 5,543,152; 5,556,948; 5,580,575; and 5,595,756, each of which is herein incorporated by reference.

[0046] The antisense compounds of the invention encompass any pharmaceutically acceptable salts, esters, or salts of such esters, or any other compound which, upon administration to an animal including a human, is capable of providing (directly or indirectly) the biologically active metabolite or residue thereof. Accordingly, for example, the disclosure is also drawn to prodrugs and pharmaceutically acceptable salts of the compounds of the invention, pharmaceutically acceptable salts of such prodrugs, and other bioequivalents.

[0047] The term “prodrug” indicates a therapeutic agent that is prepared in an inactive form that is converted to an active form (i.e., drug) within the body or cells thereof by the action of endogenous enzymes or other chemicals and/or conditions. In particular, prodrug versions of the oligonucleotides of the invention are prepared as SATE [(S-acetyl-2-thioethyl) phosphate] derivatives according to the methods disclosed in WO 93/24510 to Gosselin et al., published Dec. 9, 1993 or in WO 94/26764 and U.S. Pat. No. 5,770,713 to Imbach et al.

[0048] The term “pharmaceutically acceptable salts” refers to physiologically and pharmaceutically acceptable salts of the compounds of the invention: i.e., salts that retain the desired biological activity of the parent compound and do not impart undesired toxicological effects thereto.

[0049] Pharmaceutically acceptable base addition salts are formed with metals or amines, such as alkali and alkaline earth metals or organic amines. Examples of metals used as cations are sodium, potassium, magnesium, calcium, and the like. Examples of suitable amines are N,N′-dibenzylethylenediamine, chloroprocaine, choline, diethanolamine, dicyclohexylamine, ethylenediamine, N-methylglucamine, and procaine (see, for example, Berge et al., “Pharmaceutical Salts,” J. of Pharma Sci., 1977, 66, 1-19). The base addition salts of said acidic compounds are prepared by contacting the free acid form with a sufficient amount of the desired base to produce the salt in the conventional manner. The free acid form may be regenerated by contacting the salt form with an acid and isolating the free acid in the conventional manner. The free acid forms differ from their respective salt forms somewhat in certain physical properties such as solubility in polar solvents, but otherwise the salts are equivalent to their respective free acid for purposes of the present invention. As used herein, a “pharmaceutical addition salt” includes a pharmaceutically acceptable salt of an acid form of one of the components of the compositions of the invention. These include organic or inorganic acid salts of the amines. Preferred acid salts are the hydrochlorides, acetates, salicylates, nitrates and phosphates. Other suitable pharmaceutically acceptable salts are well known to those skilled in the art and include basic salts of a variety of inorganic and organic acids, such as, for example, with inorganic acids, such as for example hydrochloric acid, hydrobromic acid, sulfuric acid or phosphoric acid; with organic carboxylic, sulfonic, sulfo or phospho acids or N-substituted sulfamic acids, for example acetic acid, propionic acid, glycolic acid, succinic acid, maleic acid, hydroxymaleic acid, methylmaleic acid, fumaric acid, malic acid, tartaric acid, lactic acid, oxalic acid, gluconic acid, glucaric acid, glucuronic acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, salicylic acid, 4-aminosalicylic acid, 2-phenoxybenzoic acid, 2-acetoxybenzoic acid, embonic acid, nicotinic acid or isonicotinic acid; and with amino acids, such as the 20 alpha-amino acids involved in the synthesis of proteins in nature, for example glutamic acid or aspartic acid, and also with phenylacetic acid, methanesulfonic acid, ethanesulfonic acid, 2-hydroxyethanesulfonic acid, ethane-1,2-disulfonic acid, benzenesulfonic acid, 4-methylbenzenesulfonic acid, naphthalene-2-sulfonic acid, naphthalene-1,5-disulfonic acid, 2- or 3-phosphoglycerate, glucose-6-phosphate, N-cyclohexylsulfamic acid (with the formation of cyclamates), or with other acid organic compounds, such as ascorbic acid. Pharmaceutically acceptable salts of compounds may also be prepared with a pharmaceutically acceptable cation. Suitable pharmaceutically acceptable cations are well known to those skilled in the art and include alkaline, alkaline earth, ammonium and quaternary ammonium cations. Carbonates or hydrogen carbonates are also possible.

[0050] For oligonucleotides, preferred examples of pharmaceutically acceptable salts include but are not limited to (a) salts formed with cations such as sodium, potassium, ammonium, magnesium, calcium, polyamines such as spermine and spermidine, etc.; (b) acid addition salts formed with inorganic acids, for example hydrochloric acid, hydrobromic acid, sulfuric acid, phosphoric acid, nitric acid and the like; (c) salts formed with organic acids such as, for example, acetic acid, oxalic acid, tartaric acid, succinic acid, maleic acid, fumaric acid, gluconic acid, citric acid, malic acid, ascorbic acid, benzoic acid, tannic acid, palmitic acid, alginic acid, polyglutamic acid, naphthalenesulfonic acid, methanesulfonic acid, p-toluenesulfonic acid, naphthalenedisulfonic acid, polygalacturonic acid, and the like; and (d) salts formed from elemental anions such as chlorine, bromine, and iodine.

[0051] The antisense compounds of the present invention can be utilized for diagnostics, therapeutics, prophylaxis and as research reagents and kits. For therapeutics, an animal, preferably a human, suspected of having a disease or disorder which can be treated by modulating the expression of FXR is treated by administering antisense compounds in accordance with this invention. The compounds of the invention can be utilized in pharmaceutical compositions by adding an effective amount of an antisense compound to a suitable pharmaceutically acceptable diluent or carrier. Use of the antisense compounds and methods of the invention may also be useful prophylactically, e.g., to prevent or delay infection, inflammation or tumor formation, for example.

[0052] The antisense compounds of the invention are useful for research and diagnostics, because these compounds hybridize to nucleic acids encoding FXR, enabling sandwich and other assays to easily be constructed to exploit this fact. Hybridization of the antisense oligonucleotides of the invention with a nucleic acid encoding FXR can be detected by means known in the art. Such means may include conjugation of an enzyme to the oligonucleotide, radiolabelling of the oligonucleotide or any other suitable detection means. Kits using such detection means for detecting the level of FXR in a sample may also be prepared.

[0053] The present invention also includes pharmaceutical compositions and formulations which include the antisense compounds of the invention. The pharmaceutical compositions of the present invention may be administered in a number of ways depending upon whether local or systemic treatment is desired and upon the area to be treated. Administration may be topical (including ophthalmic and to mucous membranes including vaginal and rectal delivery), pulmonary, e.g., by inhalation or insufflation of powders or aerosols, including by nebulizer; intratracheal, intranasal, epidermal and transdermal), oral or parenteral. Parenteral administration includes intravenous, intraarterial, subcutaneous, intraperitoneal or intramuscular injection or infusion; or intracranial, e.g., intrathecal or intraventricular, administration. Oligonucleotides with at least one 2′-O-methoxyethyl modification are believed to be particularly useful for oral administration.

[0054] Pharmaceutical compositions and formulations for topical administration may include transdermal patches, ointments, lotions, creams, gels, drops, suppositories, sprays, liquids and powders. Conventional pharmaceutical carriers, aqueous, powder or oily bases, thickeners and the like may be necessary or desirable. Coated condoms, gloves and the like may also be useful. Preferred topical formulations include those in which the oligonucleotides of the invention are in admixture with a topical delivery agent such as lipids, liposomes, fatty acids, fatty acid esters, steroids, chelating agents and surfactants. Preferred lipids and liposomes include neutral (e.g. dioleoylphosphatidyl DOPE ethanolamine, dimyristoylphosphatidyl choline DMPC, distearolyphosphatidyl choline) negative (e.g. dimyristoylphosphatidyl glycerol DMPG) and cationic (e.g. dioleoyltetramethylaminopropyl DOTAP and dioleoylphosphatidyl ethanolamine DOTMA). Oligonucleotides of the invention may be encapsulated within liposomes or may form complexes thereto, in particular to cationic liposomes. Alternatively, oligonucleotides may be complexed to lipids, in particular to cationic lipids. Preferred fatty acids and esters include but are not limited arachidonic acid, oleic acid, eicosanoic acid, lauric acid, caprylic acid, capric acid, myristic acid, palmitic acid, stearic acid, linoleic acid, linolenic acid, dicaprate, tricaprate, monoolein, dilaurin, glyceryl 1-monocaprate, 1-dodecylazacycloheptan-2-one, an acylcarnitine, an acylcholine, or a C₁₋₁₀ alkyl ester (e.g. isopropylmyristate IPM), monoglyceride, diglyceride or pharmaceutically acceptable salt thereof. Topical formulations are described in detail in U.S. patent application Ser. No. 09/315,298 filed on May 20, 1999 which is incorporated herein by reference in its entirety.

[0055] Compositions and formulations for oral administration include powders or granules, microparticulates, nanoparticulates, suspensions or solutions in water or non-aqueous media, capsules, gel capsules, sachets, tablets or minitablets. Thickeners, flavoring agents, diluents, emulsifiers, dispersing aids or binders may be desirable. Preferred oral formulations are those in which oligonucleotides of the invention are administered in conjunction with one or more penetration enhancers surfactants and chelators. Preferred surfactants include fatty acids and/or esters or salts thereof, bile acids and/or salts thereof. Prefered bile acids/salts include chenodeoxycholic acid (CDCA) and ursodeoxychenodeoxycholic acid (UDCA), cholic acid, dehydrocholic acid, deoxycholic acid, glucholic acid, glycholic acid, glycodeoxycholic acid, taurocholic acid, taurodeoxycholic acid, sodium tauro-24,25-dihydro-fusidate, sodium glycodihydrofusidate,. Prefered fatty acids include arachidonic acid, undecanoic acid, oleic acid, lauric acid, caprylic acid, capric acid, myristic acid, palmitic acid, stearic acid, linoleic acid, linolenic acid, dicaprate, tricaprate, monoolein, dilaurin, glyceryl 1-monocaprate, 1-dodecylazacycloheptan-2-one, an acylcarnitine, an acylcholine, or a monoglyceride, a diglyceride or a pharmaceutically acceptable salt thereof (e.g. sodium). Also prefered are combinations of penetration enhancers, for example, fatty acids/salts in combination with bile acids/salts. A particularly prefered combination is the sodium salt of lauric acid, capric acid and UDCA. Further penetration enhancers include polyoxyethylene-9-lauryl ether, polyoxyethylene-20-cetyl ether. Oligonucleotides of the invention may be delivered orally in granular form including sprayed dried particles, or complexed to form micro or nanoparticles. Oligonucleotide complexing agents include poly-amino acids; polyimines; polyacrylates; polyalkylacrylates, polyoxethanes, polyalkylcyanoacrylates; cationized gelatins, albumins, starches, acrylates, polyethyleneglycols (PEG) and starches; polyalkylcyanoacrylates; DEAE-derivatized polyimines, pollulans, celluloses and starches. Particularly preferred complexing agents include chitosan, N-trimethylchitosan, poly-L-lysine, polyhistidine, polyornithine, polyspermines, protamine, polyvinylpyridine, polythiodiethylamino-methylethylene P(TDAE), polyaminostyrene (e.g. p-amino), poly(methylcyanoacrylate), poly(ethylcyanoacrylate), poly(butylcyanoacrylate), poly(isobutylcyanoacrylate), poly(isohexylcynaoacrylate), DEAE-methacrylate, DEAE-hexylacrylate, DEAE-acrylamide, DEAE-albumin and DEAE-dextran, polymethylacrylate, polyhexylacrylate, poly(D,L-lactic acid), poly(DL-lactic-co-glycolic acid (PLGA), alginate, and polyethyleneglycol (PEG). Oral formulations for oligonucleotides and their preparation are described in detail in U.S. application Ser. Nos. 08/886,829 (filed Jul. 1, 1997), 09/108,673 (filed Jul. 1, 1998), 09/256,515 (filed Feb. 23, 1999), 09/082,624 (filed May 21, 1998) and 09/315,298 (filed May 20, 1999) each of which is incorporated herein by reference in their entirety.

[0056] Compositions and formulations for parenteral, intrathecal or intraventricular administration may include sterile aqueous solutions which may also contain buffers, diluents and other suitable additives such as, but not limited to, penetration enhancers, carrier compounds and other pharmaceutically acceptable carriers or excipients.

[0057] Pharmaceutical compositions of the present invention include, but are not limited to, solutions, emulsions, and liposome-containing formulations. These compositions may be generated from a variety of components that include, but are not limited to, preformed liquids, self-emulsifying solids and self-emulsifying semisolids.

[0058] The pharmaceutical formulations of the present invention, which may conveniently be presented in unit dosage form, may be prepared according to conventional techniques well known in the pharmaceutical industry. Such techniques include the step of bringing into association the active ingredients with the pharmaceutical carrier(s) or excipient(s). In general the formulations are prepared by uniformly and intimately bringing into association the active ingredients with liquid carriers or finely divided solid carriers or both, and then, if necessary, shaping the product.

[0059] The compositions of the present invention may be formulated into any of many possible dosage forms such as, but not limited to, tablets, capsules, gel capsules, liquid syrups, soft gels, suppositories, and enemas. The compositions of the present invention may also be formulated as suspensions in aqueous, non-aqueous or mixed media. Aqueous suspensions may further contain substances which increase the viscosity of the suspension including, for example, sodium carboxymethylcellulose, sorbitol and/or dextran. The suspension may also contain stabilizers.

[0060] In one embodiment of the present invention the pharmaceutical compositions may be formulated and used as foams. Pharmaceutical foams include formulations such as, but not limited to, emulsions, microemulsions, creams, jellies and liposomes. While basically similar in nature these formulations vary in the components and the consistency of the final product. The preparation of such compositions and formulations is generally known to those skilled in the pharmaceutical and formulation arts and may be applied to the formulation of the compositions of the present invention.

[0061] Emulsions

[0062] The compositions of the present invention may be prepared and formulated as emulsions. Emulsions are typically heterogenous systems of one liquid dispersed in another in the form of droplets usually exceeding 0.1 μm in diameter. (Idson, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 199; Rosoff, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., Volume 1, p. 245; Block in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 2, p. 335; Higuchi et al., in Remington's Pharmaceutical Sciences, Mack Publishing Co., Easton, Pa., 1985, p. 301). Emulsions are often biphasic systems comprising of two immiscible liquid phases intimately mixed and dispersed with each other. In general, emulsions may be either water-in-oil (w/o) or of the oil-in-water (o/w) variety. When an aqueous phase is finely divided into and dispersed as minute droplets into a bulk oily phase the resulting composition is called a water-in-oil (w/o) emulsion. Alternatively, when an oily phase is finely divided into and dispersed as minute droplets into a bulk aqueous phase the resulting composition is called an oil-in-water (o/w) emulsion. Emulsions may contain additional components in addition to the dispersed phases and the active drug which may be present as a solution in either the aqueous phase, oily phase or itself as a separate phase. Pharmaceutical excipients such as emulsifiers, stabilizers, dyes, and anti-oxidants may also be present in emulsions as needed. Pharmaceutical emulsions may also be multiple emulsions that are comprised of more than two phases such as, for example, in the case of oil-in-water-in-oil (o/w/o) and water-in-oil-in-water (w/o/w) emulsions. Such complex formulations often provide certain advantages that simple binary emulsions do not. Multiple emulsions in which individual oil droplets of an o/w emulsion enclose small water droplets constitute a w/o/w emulsion. Likewise a system of oil droplets enclosed in globules of water stabilized in an oily continuous provides an o/w/o emulsion.

[0063] Emulsions are characterized by little or no thermodynamic stability. Often, the dispersed or discontinuous phase of the emulsion is well dispersed into the external or continuous phase and maintained in this form through the means of emulsifiers or the viscosity of the formulation. Either of the phases of the emulsion may be a semisolid or a solid, as is the case of emulsion-style ointment bases and creams. Other means of stabilizing emulsions entail the use of emulsifiers that may be incorporated into either phase of the emulsion. Emulsifiers may broadly be classified into four categories: synthetic surfactants, naturally occurring emulsifiers, absorption bases, and finely dispersed solids (Idson, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 199).

[0064] Synthetic surfactants, also known as surface active agents, have found wide applicability in the formulation of emulsions and have been reviewed in the literature (Rieger, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 285; Idson, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), Marcel Dekker, Inc., New York, N.Y., 1988, volume 1, p. 199). Surfactants are typically amphiphilic and comprise a hydrophilic and a hydrophobic portion. The ratio of the hydrophilic to the hydrophobic nature of the surfactant has been termed the hydrophile/lipophile balance (HLB) and is a valuable tool in categorizing and selecting surfactants in the preparation of formulations. Surfactants may be classified into different classes based on the nature of the hydrophilic group: nonionic, anionic, cationic and amphoteric (Rieger, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 285).

[0065] Naturally occurring emulsifiers used in emulsion formulations include lanolin, beeswax, phosphatides, lecithin and acacia. Absorption bases possess hydrophilic properties such that they can soak up water to form w/o emulsions yet retain their semisolid consistencies, such as anhydrous lanolin and hydrophilic petrolatum. Finely divided solids have also been used as good emulsifiers especially in combination with surfactants and in viscous preparations. These include polar inorganic solids, such as heavy metal hydroxides, nonswelling clays such as bentonite, attapulgite, hectorite, kaolin, montmorillonite, colloidal aluminum silicate and colloidal magnesium aluminum silicate, pigments and nonpolar solids such as carbon or glyceryl tristearate.

[0066] A large variety of non-emulsifying materials are also included in emulsion formulations and contribute to the properties of emulsions. These include fats, oils, waxes, fatty acids, fatty alcohols, fatty esters, humectants, hydrophilic colloids, preservatives and antioxidants (Block, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 335; Idson, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 199).

[0067] Hydrophilic colloids or hydrocolloids include naturally occurring gums and synthetic polymers such as polysaccharides (for example, acacia, agar, alginic acid, carrageenan, guar gum, karaya gum, and tragacanth), cellulose derivatives (for example, carboxymethylcellulose and carboxypropylcellulose), and synthetic polymers (for example, carbomers, cellulose ethers, and carboxyvinyl polymers). These disperse or swell in water to form colloidal solutions that stabilize emulsions by forming strong interfacial films around the dispersed-phase droplets and by increasing the viscosity of the external phase.

[0068] Since emulsions often contain a number of ingredients such as carbohydrates, proteins, sterols and phosphatides that may readily support the growth of microbes, these formulations often incorporate preservatives. Commonly used preservatives included in emulsion formulations include methyl paraben, propyl paraben, quaternary ammonium salts, benzalkonium chloride, esters of p-hydroxybenzoic acid, and boric acid. Antioxidants are also commonly added to emulsion formulations to prevent deterioration of the formulation. Antioxidants used may be free radical scavengers such as tocopherols, alkyl gallates, butylated hydroxyanisole, butylated hydroxytoluene, or reducing agents such as ascorbic acid and sodium metabisulfite, and antioxidant synergists such as citric acid, tartaric acid, and lecithin.

[0069] The application of emulsion formulations via dermatological, oral and parenteral routes and methods for their manufacture have been reviewed in the literature (Idson, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 199). Emulsion formulations for oral delivery have been very widely used because of reasons of ease of formulation, efficacy from an absorption and bioavailability standpoint. (Rosoff, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 245; Idson, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 199). Mineral-oil base laxatives, oil-soluble vitamins and high fat nutritive preparations are among the materials that have commonly been administered orally as o/w emulsions.

[0070] In one embodiment of the present invention, the compositions of oligonucleotides and nucleic acids are formulated as microemulsions. A microemulsion may be defined as a system of water, oil and amphiphile which is a single optically isotropic and thermodynamically stable liquid solution (Rosoff, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 245). Typically microemulsions are systems that are prepared by first dispersing an oil in an aqueous surfactant solution and then adding a sufficient amount of a fourth component, generally an intermediate chain-length alcohol to form a transparent system. Therefore, microemulsions have also been described as thermodynamically stable, isotropically clear dispersions of two immiscible liquids that are stabilized by interfacial films of surface-active molecules (Leung and Shah, in: Controlled Release of Drugs: Polymers and Aggregate Systems, Rosoff, M., Ed., 1989, VCH Publishers, New York, pages 185-215). Microemulsions commonly are prepared via a combination of three to five components that include oil, water, surfactant, cosurfactant and electrolyte. Whether the microemulsion is of the water-in-oil (w/o) or an oil-in-water (o/w) type is dependent on the properties of the oil and surfactant used and on the structure and geometric packing of the polar heads and hydrocarbon tails of the surfactant molecules (Schott, in Remington's Pharmaceutical Sciences, Mack Publishing Co., Easton, Pa., 1985, p. 271).

[0071] The phenomenological approach utilizing phase diagrams has been extensively studied and has yielded a comprehensive knowledge, to one skilled in the art, of how to formulate microemulsions (Rosoff, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 245; Block, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 335). Compared to conventional emulsions, microemulsions offer the advantage of solubilizing water-insoluble drugs in a formulation of thermodynamically stable droplets that are formed spontaneously.

[0072] Surfactants used in the preparation of microemulsions include, but are not limited to, ionic surfactants, non-ionic surfactants, Brij 96, polyoxyethylene oleyl ethers, polyglycerol fatty acid esters, tetraglycerol monolaurate (ML310), tetraglycerol monooleate (MO310), hexaglycerol monooleate (PO310), hexaglycerol pentaoleate (PO500), decaglycerol monocaprate (MCA750), decaglycerol monooleate (MO750), decaglycerol sequioleate (SO750), decaglycerol decaoleate (DA0750), alone or in combination with cosurfactants. The cosurfactant, usually a short-chain alcohol such as ethanol, 1-propanol, and 1-butanol, serves to increase the interfacial fluidity by penetrating into the surfactant film and consequently creating a disordered film because of the void space generated among surfactant molecules. Microemulsions may, however, be prepared without the use of cosurfactants and alcohol-free self-emulsifying microemulsion systems are known in the art. The aqueous phase may typically be, but is not limited to, water, an aqueous solution of the drug, glycerol, PEG300, PEG400, polyglycerols, propylene glycols, and derivatives of ethylene glycol. The oil phase may include, but is not limited to, materials such as Captex 300, Captex 355, Capmul MCM, fatty acid esters, medium chain (C8-C12) mono, di, and tri-glycerides, polyoxyethylated glyceryl fatty acid esters, fatty alcohols, polyglycolized glycerides, saturated polyglycolized C8-C10 glycerides, vegetable oils and silicone oil.

[0073] Microemulsions are particularly of interest from the standpoint of drug solubilization and the enhanced absorption of drugs. Lipid based microemulsions (both o/w and w/o) have been proposed to enhance the oral bioavailability of drugs, including peptides (Constantinides et al., Pharmaceutical Research, 1994, 11, 1385-1390; Ritschel, Meth. Find. Exp. Clin. Pharmacol., 1993, 13, 205). Microemulsions afford advantages of improved drug solubilization, protection of drug from enzymatic hydrolysis, possible enhancement of drug absorption due to surfactant-induced alterations in membrane fluidity and permeability, ease of preparation, ease of oral administration over solid dosage forms, improved clinical potency, and decreased toxicity (Constantinides et al., Pharmaceutical Research, 1994, 11, 1385; Ho et al., J. Pharm. Sci., 1996, 85, 138-143). Often microemulsions may form spontaneously when their components are brought together at ambient temperature. This may be particularly advantageous when formulating thermolabile drugs, peptides or oligonucleotides. Microemulsions have also been effective in the transdermal delivery of active components in both cosmetic and pharmaceutical applications. It is expected that the microemulsion compositions and formulations of the present invention will facilitate the increased systemic absorption of oligonucleotides and nucleic acids from the gastrointestinal tract, as well as improve the local cellular uptake of oligonucleotides and nucleic acids within the gastrointestinal tract, vagina, buccal cavity and other areas of administration.

[0074] Microemulsions of the present invention may also contain additional components and additives such as sorbitan monostearate (Grill 3), Labrasol, and penetration enhancers to improve the properties of the formulation and to enhance the absorption of the oligonucleotides and nucleic acids of the present invention. Penetration enhancers used in the microemulsions of the present invention may be classified as belonging to one of five broad categories—surfactants, fatty acids, bile salts, chelating agents, and non-chelating non-surfactants (Lee et al., Critical Reviews in Therapeutic Drug Carrier Systems, 1991, p. 92) Each of these classes has been discussed above.

[0075] Liposomes

[0076] There are many organized surfactant structures besides microemulsions that have been studied and used for the formulation of drugs. These include monolayers, micelles, bilayers and vesicles. Vesicles, such as liposomes, have attracted great interest because of their specificity and the duration of action they offer from the standpoint of drug delivery. As used in the present invention, the term “liposome” means a vesicle composed of amphiphilic lipids arranged in a spherical bilayer or bilayers.

[0077] Liposomes are unilamellar or multilamellar vesicles which have a membrane formed from a lipophilic material and an aqueous interior. The aqueous portion contains the composition to be delivered. Cationic liposomes possess the advantage of being able to fuse to the cell wall. Non-cationic liposomes, although not able to fuse as efficiently with the cell wall, are taken up by macrophages in vivo.

[0078] In order to cross intact mammalian skin, lipid vesicles must pass through a series of fine pores, each with a diameter less than 50 nm, under the influence of a suitable transdermal gradient. Therefore, it is desirable to use a liposome which is highly deformable and able to pass through such fine pores.

[0079] Further advantages of liposomes include; liposomes obtained from natural phospholipids are biocompatible and biodegradable; liposomes can incorporate a wide range of water and lipid soluble drugs; liposomes can protect encapsulated drugs in their internal compartments from metabolism and degradation (Rosoff, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 245). Important considerations in the preparation of liposome formulations are the lipid surface charge, vesicle size and the aqueous volume of the liposomes.

[0080] Liposomes are useful for the transfer and delivery of active ingredients to the site of action. Because the liposomal membrane is structurally similar to biological membranes, when liposomes are applied to a tissue, the liposomes start to merge with the cellular membranes. As the merging of the liposome and cell progresses, the liposomal contents are emptied into the cell where the active agent may act.

[0081] Liposomal formulations have been the focus of extensive investigation as the mode of delivery for many drugs. There is growing evidence that for topical administration, liposomes present several advantages over other formulations. Such advantages include reduced side-effects related to high systemic absorption of the administered drug, increased accumulation of the administered drug at the desired target, and the ability to administer a wide variety of drugs, both hydrophilic and hydrophobic, into the skin.

[0082] Several reports have detailed the ability of liposomes to deliver agents including high-molecular weight DNA into the skin. Compounds including analgesics, antibodies, hormones and high-molecular weight DNAs have been administered to the skin. The majority of applications resulted in the targeting of the upper epidermis.

[0083] Liposomes fall into two broad classes. Cationic liposomes are positively charged liposomes which interact with the negatively charged DNA molecules to form a stable complex. The positively charged DNA/liposome complex binds to the negatively charged cell surface and is internalized in an endosome. Due to the acidic pH within the endosome, the liposomes are ruptured, releasing their contents into the cell cytoplasm (Wang et al., Biochem. Biophys. Res. Commun., 1987, 147, 980-985).

[0084] Liposomes which are pH-sensitive or negatively-charged, entrap DNA rather than complex with it. Since both the DNA and the lipid are similarly charged, repulsion rather than complex formation occurs. Nevertheless, some DNA is entrapped within the aqueous interior of these liposomes. pH-sensitive liposomes have been used to deliver DNA encoding the thymidine kinase gene to cell monolayers in culture. Expression of the exogenous gene was detected in the target cells (Zhou et al., Journal of Controlled Release, 1992, 19, 269-274).

[0085] One major type of liposomal composition includes phospholipids other than naturally-derived phosphatidylcholine. Neutral liposome compositions, for example, can be formed from dimyristoyl phosphatidylcholine (DMPC) or dipalmitoyl phosphatidylcholine (DPPC). Anionic liposome compositions generally are formed from dimyristoyl phosphatidylglycerol, while anionic fusogenic liposomes are formed primarily from dioleoyl phosphatidylethanolamine (DOPE). Another type of liposomal composition is formed from phosphatidylcholine (PC) such as, for example, soybean PC, and egg PC. Another type is formed from mixtures of phospholipid and/or phosphatidylcholine and/or cholesterol.

[0086] Several studies have assessed the topical delivery of liposomal drug formulations to the skin. Application of liposomes containing interferon to guinea pig skin resulted in a reduction of skin herpes sores while delivery of interferon via other means (e.g. as a solution or as an emulsion) were ineffective (Weiner et al., Journal of Drug Targeting, 1992, 2, 405-410). Further, an additional study tested the efficacy of interferon administered as part of a liposomal formulation to the administration of interferon using an aqueous system, and concluded that the liposomal formulation was superior to aqueous administration (du Plessis et al., Antiviral Research, 1992, 18, 259-265).

[0087] Non-ionic liposomal systems have also been examined to determine their utility in the delivery of drugs to the skin, in particular systems comprising non-ionic surfactant and cholesterol. Non-ionic liposomal formulations comprising Novasome™ I (glyceryl dilaurate/cholesterol/polyoxyethylene-10-stearyl ether) and Novasome™ II (glyceryl distearate/cholesterol/polyoxyethylene-10-stearyl ether) were used to deliver cyclosporin-A into the dermis of mouse skin. Results indicated that such non-ionic liposomal systems were effective in facilitating the deposition of cyclosporin-A into different layers of the skin (Hu et al. S.T.P.Pharma. Sci., 1994, 4, 6, 466).

[0088] Liposomes also include “sterically stabilized” liposomes, a term which, as used herein, refers to liposomes comprising one or more specialized lipids that, when incorporated into liposomes, result in enhanced circulation lifetimes relative to liposomes lacking such specialized lipids. Examples of sterically stabilized liposomes are those in which part of the vesicle-forming lipid portion of the liposome (A) comprises one or more glycolipids, such as monosialoganglioside G_(M1), or (B) is derivatized with one or more hydrophilic polymers, such as a polyethylene glycol (PEG) moiety. While not wishing to be bound by any particular theory, it is thought in the art that, at least for sterically stabilized liposomes containing gangliosides, sphingomyelin, or PEG-derivatized lipids, the enhanced circulation half-life of these sterically stabilized liposomes derives from a reduced uptake into cells of the reticuloendothelial system (RES) (Allen et al., FEBS Letters, 1987, 223, 42; Wu et al., Cancer Research, 1993, 53, 3765).

[0089] Various liposomes comprising one or more glycolipids are known in the art. Papahadjopoulos et al. (Ann. N.Y. Acad. Sci., 1987, 507, 64) reported the ability of monosialoganglioside G_(M1), galactocerebroside sulfate and phosphatidylinositol to improve blood half-lives of liposomes. These findings were expounded upon by Gabizon et al. (Proc. Natl. Acad. Sci. U.S.A., 1988, 85, 6949). U.S. Pat. No. 4,837,028 and WO 88/04924, both to Allen et al., disclose liposomes comprising (1) sphingomyelin and (2) the ganglioside G_(M1) or a galactocerebroside sulfate ester. U.S. Pat. No. 5,543,152 (Webb et al.) discloses liposomes comprising sphingomyelin. Liposomes comprising 1,2-sn-dimyristoylphosphatidylcholine are disclosed in WO 97/13499 (Lim et al.).

[0090] Many liposomes comprising lipids derivatized with one or more hydrophilic polymers, and methods of preparation thereof, are known in the art. Sunamoto et al. (Bull. Chem. Soc. Jpn., 1980, 53, 2778) described liposomes comprising a nonionic detergent, 2C₁₂15G, that contains a PEG moiety. Illum et al. (FEBS Lett., 1984, 167, 79) noted that hydrophilic coating of polystyrene particles with polymeric glycols results in significantly enhanced blood half-lives. Synthetic phospholipids modified by the attachment of carboxylic groups of polyalkylene glycols (e.g., PEG) are described by Sears (U.S. Pat. Nos. 4,426,330 and 4,534,899). Klibanov et al. (FEBS Lett., 1990, 268, 235) described experiments demonstrating that liposomes comprising phosphatidylethanolamine (PE) derivatized with PEG or PEG stearate have significant increases in blood circulation half-lives. Blume et al. (Biochimica et Biophysica Acta, 1990, 1029, 91) extended such observations to other PEG-derivatized phospholipids, e.g., DSPE-PEG, formed from the combination of distearoylphosphatidylethanolamine (DSPE) and PEG. Liposomes having covalently bound PEG moieties on their external surface are described in European Patent No. EP 0 445 131 B1 and WO 90/04384 to Fisher. Liposome compositions containing 1-20 mole percent of PE derivatized with PEG, and methods of use thereof, are described by Woodle et al. (U.S. Pat. Nos. 5,013,556 and 5,356,633) and Martin et al. (U.S. Pat. No. 5,213,804 and European Patent No. EP 0 496 813 B1). Liposomes comprising a number of other lipid-polymer conjugates are disclosed in WO 91/05545 and U.S. Pat. No. 5,225,212 (both to Martin et al.) and in WO 94/20073 (Zalipsky et al.) Liposomes comprising PEG-modified ceramide lipids are described in WO 96/10391 (Choi et al.). U.S. Pat. Nos. 5,540,935 (Miyazaki et al.) and 5,556,948 (Tagawa et al.) describe PEG-containing liposomes that can be further derivatized with functional moieties on their surfaces.

[0091] A limited number of liposomes comprising nucleic acids are known in the art. WO 96/40062 to Thierry et al. discloses methods for encapsulating high molecular weight nucleic acids in liposomes. U.S. Pat. No. 5,264,221 to Tagawa et al. discloses protein-bonded liposomes and asserts that the contents of such liposomes may include an antisense RNA. U.S. Pat. No. 5,665,710 to Rahman et al. describes certain methods of encapsulating oligodeoxynucleotides in liposomes. WO 97/04787 to Love et al. discloses liposomes comprising antisense oligonucleotides targeted to the raf gene.

[0092] Transfersomes are yet another type of liposomes, and are highly deformable lipid aggregates which are attractive candidates for drug delivery vehicles. Transfersomes may be described as lipid droplets which are so highly deformable that they are easily able to penetrate through pores which are smaller than the droplet. Transfersomes are adaptable to the environment in which they are used, e.g. they are self-optimizing (adaptive to the shape of pores in the skin), self-repairing, frequently reach their targets without fragmenting, and often self-loading. To make transfersomes it is possible to add surface edge-activators, usually surfactants, to a standard liposomal composition. Transfersomes have been used to deliver serum albumin to the skin. The transfersome-mediated delivery of serum albumin has been shown to be as effective as subcutaneous injection of a solution containing serum albumin.

[0093] Surfactants find wide application in formulations such as emulsions (including microemulsions) and liposomes. The most common way of classifying and ranking the properties of the many different types of surfactants, both natural and synthetic, is by the use of the hydrophile/lipophile balance (HLB). The nature of the hydrophilic group (also known as the “head”) provides the most useful means for categorizing the different surfactants used in formulations (Rieger, in Pharmaceutical Dosage Forms, Marcel Dekker, Inc., New York, N.Y., 1988, p. 285).

[0094] If the surfactant molecule is not ionized, it is classified as a nonionic surfactant. Nonionic surfactants find wide application in pharmaceutical and cosmetic products and are usable over a wide range of pH values. In general their HLB values range from 2 to about 18 depending on their structure. Nonionic surfactants include nonionic esters such as ethylene glycol esters, propylene glycol esters, glyceryl esters, polyglyceryl esters, sorbitan esters, sucrose esters, and ethoxylated esters. Nonionic alkanolamides and ethers such as fatty alcohol ethoxylates, propoxylated alcohols, and ethoxylated/propoxylated block polymers are also included in this class. The polyoxyethylene surfactants are the most popular members of the nonionic surfactant class.

[0095] If the surfactant molecule carries a negative charge when it is dissolved or dispersed in water, the surfactant is classified as anionic. Anionic surfactants include carboxylates such as soaps, acyl lactylates, acyl amides of amino acids, esters of sulfuric acid such as alkyl sulfates and ethoxylated alkyl sulfates, sulfonates such as alkyl benzene sulfonates, acyl isethionates, acyl taurates and sulfosuccinates, and phosphates. The most important members of the anionic surfactant class are the alkyl sulfates and the soaps.

[0096] If the surfactant molecule carries a positive charge when it is dissolved or dispersed in water, the surfactant is classified as cationic. Cationic surfactants include quaternary ammonium salts and ethoxylated amines. The quaternary ammonium salts are the most used members of this class.

[0097] If the surfactant molecule has the ability to carry either a positive or negative charge, the surfactant is classified as amphoteric. Amphoteric surfactants include acrylic acid derivatives, substituted alkylamides, N-alkylbetaines and phosphatides.

[0098] The use of surfactants in drug products, formulations and in emulsions has been reviewed (Rieger, in Pharmaceutical Dosage Forms, Marcel Dekker, Inc., New York, N.Y., 1988, p. 285).

[0099] Penetration Enhancers

[0100] In one embodiment, the present invention employs various penetration enhancers to effect the efficient delivery of nucleic acids, particularly oligonucleotides, to the skin of animals. Most drugs are present in solution in both ionized and nonionized forms. However, usually only lipid soluble or lipophilic drugs readily cross cell membranes. It has been discovered that even non-lipophilic drugs may cross cell membranes if the membrane to be crossed is treated with a penetration enhancer. In addition to aiding the diffusion of non-lipophilic drugs across cell membranes, penetration enhancers also enhance the permeability of lipophilic drugs.

[0101] Penetration enhancers may be classified as belonging to one of five broad categories, i.e., surfactants, fatty acids, bile salts, chelating agents, and non-chelating non-surfactants (Lee et al., Critical Reviews in Therapeutic Drug Carrier Systems, 1991, p.92). Each of the above mentioned classes of penetration enhancers are described below in greater detail.

[0102] Surfactants: In connection with the present invention, surfactants (or “surface-active agents”) are chemical entities which, when dissolved in an aqueous solution, reduce the surface tension of the solution or the interfacial tension between the aqueous solution and another liquid, with the result that absorption of oligonucleotides through the mucosa is enhanced. In addition to bile salts and fatty acids, these penetration enhancers include, for example, sodium lauryl sulfate, polyoxyethylene-9-lauryl ether and polyoxyethylene-20-cetyl ether) (Lee et al., Critical Reviews in Therapeutic Drug Carrier Systems, 1991, p.92); and perfluorochemical emulsions, such as FC-43. Takahashi et al., J. Pharm. Pharmacol., 1988, 40, 252).

[0103] Fatty acids: Various fatty acids and their derivatives which act as penetration enhancers include, for example, oleic acid, lauric acid, capric acid (n-decanoic acid), myristic acid, palmitic acid, stearic acid, linoleic acid, linolenic acid, dicaprate, tricaprate, monoolein (1-monooleoyl-rac-glycerol), dilaurin, caprylic acid, arachidonic acid, glycerol 1-monocaprate, 1-dodecylazacycloheptan-2-one, acylcarnitines, acylcholines, C₁₋₁₀ alkyl esters thereof (e.g., methyl, isopropyl and t-butyl), and mono- and di-glycerides thereof (i.e., oleate, laurate, caprate, myristate, palmitate, stearate, linoleate, etc.) (Lee et al., Critical Reviews in Therapeutic Drug Carrier Systems, 1991, p.92; Muranishi, Critical Reviews in Therapeutic Drug Carrier Systems, 1990, 7, 1-33; El Hariri et al., J. Pharm. Pharmacol., 1992, 44, 651-654).

[0104] Bile salts: The physiological role of bile includes the facilitation of dispersion and absorption of lipids and fat-soluble vitamins (Brunton, Chapter 38 in: Goodman & Gilman's The Pharmacological Basis of Therapeutics, 9th Ed., Hardman et al. Eds., McGraw-Hill, New York, 1996, pp. 934-935). Various natural bile salts, and their synthetic derivatives, act as penetration enhancers. Thus the term “bile salts” includes any of the naturally occurring components of bile as well as any of their synthetic derivatives. The bile salts of the invention include, for example, cholic acid (or its pharmaceutically acceptable sodium salt, sodium cholate), dehydrocholic acid (sodium dehydrocholate), deoxycholic acid (sodium deoxycholate), glucholic acid (sodium glucholate), glycholic acid (sodium glycocholate), glycodeoxycholic acid (sodium glycodeoxycholate), taurocholic acid (sodium taurocholate), taurodeoxycholic acid (sodium taurodeoxycholate), chenodeoxycholic acid (sodium chenodeoxycholate), ursodeoxycholic acid (UDCA), sodium tauro-24,25-dihydro-fusidate (STDHF), sodium glycodihydrofusidate and polyoxyethylene-9-lauryl ether (POE) (Lee et al., Critical Reviews in Therapeutic Drug Carrier Systems, 1991, page 92; Swinyard, Chapter 39 In: Remington's Pharmaceutical Sciences, 18th Ed., Gennaro, ed., Mack Publishing Co., Easton, Pa., 1990, pages 782-783; Muranishi, Critical Reviews in Therapeutic Drug Carrier Systems, 1990, 7, 1-33; Yamamoto et al., J. Pharm. Exp. Ther., 1992, 263, 25; Yamashita et al., J. Pharm. Sci., 1990, 79, 579-583).

[0105] Chelating Agents: Chelating agents, as used in connection with the present invention, can be defined as compounds that remove metallic ions from solution by forming complexes therewith, with the result that absorption of oligonucleotides through the mucosa is enhanced. With regards to their use as penetration enhancers in the present invention, chelating agents have the added advantage of also serving as DNase inhibitors, as most characterized DNA nucleases require a divalent metal ion for catalysis and are thus inhibited by chelating agents (Jarrett, J. Chromatogr., 1993, 618, 315-339). Chelating agents of the invention include but are not limited to disodium ethylenediaminetetraacetate (EDTA), citric acid, salicylates (e.g., sodium salicylate, 5-methoxysalicylate and homovanilate), N-acyl derivatives of collagen, laureth-9 and N-amino acyl derivatives of beta-diketones (enamines)(Lee et al., Critical Reviews in Therapeutic Drug Carrier Systems, 1991, page 92; Muranishi, Critical Reviews in Therapeutic Drug Carrier Systems, 1990, 7, 1-33; Buur et al., J. Control Rel., 1990, 14, 43-51).

[0106] Non-chelating non-surfactants: As used herein, non-chelating non-surfactant penetration enhancing compounds can be defined as compounds that demonstrate insignificant activity as chelating agents or as surfactants but that nonetheless enhance absorption of oligonucleotides through the alimentary mucosa (Muranishi, Critical Reviews in Therapeutic Drug Carrier Systems, 1990, 7, 1-33). This class of penetration enhancers include, for example, unsaturated cyclic ureas, 1-alkyl- and 1-alkenylazacyclo-alkanone derivatives (Lee et al., Critical Reviews in Therapeutic Drug Carrier Systems, 1991, page 92); and non-steroidal anti-inflammatory agents such as diclofenac sodium, indomethacin and phenylbutazone (Yamashita et al., J. Pharm. Pharmacol., 1987, 39, 621-626).

[0107] Agents that enhance uptake of oligonucleotides at the cellular level may also be added to the pharmaceutical and other compositions of the present invention. For example, cationic lipids, such as lipofectin (Junichi et al, U.S. Pat. No. 5,705,188), cationic glycerol derivatives, and polycationic molecules, such as polylysine (Lollo et al., PCT Application WO 97/30731), are also known to enhance the cellular uptake of oligonucleotides.

[0108] Other agents may be utilized to enhance the penetration of the administered nucleic acids, including glycols such as ethylene glycol and propylene glycol, pyrrols such as 2-pyrrol, azones, and terpenes such as limonene and menthone.

[0109] Carriers

[0110] Certain compositions of the present invention also incorporate carrier compounds in the formulation. As used herein, “carrier compound” or “carrier” can refer to a nucleic acid, or analog thereof, which is inert (i.e., does not possess biological activity per se) but is recognized as a nucleic acid by in vivo processes that reduce the bioavailability of a nucleic acid having biological activity by, for example, degrading the biologically active nucleic acid or promoting its removal from circulation. The coadministration of a nucleic acid and a carrier compound, typically with an excess of the latter substance, can result in a substantial reduction of the amount of nucleic acid recovered in the liver, kidney or other extracirculatory reservoirs, presumably due to competition between the carrier compound and the nucleic acid for a common receptor. For example, the recovery of a partially phosphorothioate oligonucleotide in hepatic tissue can be reduced when it is coadministered with polyinosinic acid, dextran sulfate, polycytidic acid or 4-acetamido-4′isothiocyano-stilbene-2,2′-disulfonic acid (Miyao et al., Antisense Res. Dev., 1995, 5, 115-121; Takakura et al., Antisense & Nucl. Acid Drug Dev., 1996, 6, 177-183).

[0111] Excipients

[0112] In contrast to a carrier compound, a “pharmaceutical carrier” or “excipient” is a pharmaceutically acceptable solvent, suspending agent or any other pharmacologically inert vehicle for delivering one or more nucleic acids to an animal. The excipient may be liquid or solid and is selected, with the planned manner of administration in mind, so as to provide for the desired bulk, consistency, etc., when combined with a nucleic acid and the other components of a given pharmaceutical composition. Typical pharmaceutical carriers include, but are not limited to, binding agents (e.g., pregelatinized maize starch, polyvinylpyrrolidone or hydroxypropyl methylcellulose, etc.); fillers (e.g., lactose and other sugars, microcrystalline cellulose, pectin, gelatin, calcium sulfate, ethyl cellulose, polyacrylates or calcium hydrogen phosphate, etc.); lubricants (e.g., magnesium stearate, talc, silica, colloidal silicon dioxide, stearic acid, metallic stearates, hydrogenated vegetable oils, corn starch, polyethylene glycols, sodium benzoate, sodium acetate, etc.); disintegrants (e.g., starch, sodium starch glycolate, etc.); and wetting agents (e.g., sodium lauryl sulphate, etc.).

[0113] Pharmaceutically acceptable organic or inorganic excipient suitable for non-parenteral administration which do not deleteriously react with nucleic acids can also be used to formulate the compositions of the present invention. Suitable pharmaceutically acceptable carriers include, but are not limited to, water, salt solutions, alcohols, polyethylene glycols, gelatin, lactose, amylose, magnesium stearate, talc, silicic acid, viscous paraffin, hydroxymethylcellulose, polyvinylpyrrolidone and the like.

[0114] Formulations for topical administration of nucleic acids may include sterile and non-sterile aqueous solutions, non-aqueous solutions in common solvents such as alcohols, or solutions of the nucleic acids in liquid or solid oil bases. The solutions may also contain buffers, diluents and other suitable additives. Pharmaceutically acceptable organic or inorganic excipients suitable for non-parenteral administration which do not deleteriously react with nucleic acids can be used.

[0115] Suitable pharmaceutically acceptable excipients include, but are not limited to, water, salt solutions, alcohol, polyethylene glycols, gelatin, lactose, amylose, magnesium stearate, talc, silicic acid, viscous paraffin, hydroxymethylcellulose, polyvinylpyrrolidone and the like.

[0116] Other Components

[0117] The compositions of the present invention may additionally contain other adjunct components conventionally found in pharmaceutical compositions, at their art-established usage levels. Thus, for example, the compositions may contain additional, compatible, pharmaceutically-active materials such as, for example, antipruritics, astringents, local anesthetics or anti-inflammatory agents, or may contain additional materials useful in physically formulating various dosage forms of the compositions of the present invention, such as dyes, flavoring agents, preservatives, antioxidants, opacifiers, thickening agents and stabilizers. However, such materials, when added, should not unduly interfere with the biological activities of the components of the compositions of the present invention. The formulations can be sterilized and, if desired, mixed with auxiliary agents, e.g., lubricants, preservatives, stabilizers, wetting agents, emulsifiers, salts for influencing osmotic pressure, buffers, colorings, flavorings and/or aromatic substances and the like which do not deleteriously interact with the nucleic acid(s) of the formulation.

[0118] Aqueous suspensions may contain substances which increase the viscosity of the suspension including, for example, sodium carboxymethylcellulose, sorbitol and/or dextran. The suspension may also contain stabilizers.

[0119] Certain embodiments of the invention provide pharmaceutical compositions containing (a) one or more antisense compounds and (b) one or more other chemotherapeutic agents which function by a non-antisense mechanism. Examples of such chemotherapeutic agents include but are not limited to daunorubicin, daunomycin, dactinomycin, doxorubicin, epirubicin, idarubicin, esorubicin, bleomycin, mafosfamide, ifosfamide, cytosine arabinoside, bis-chloroethylnitrosurea, busulfan, mitomycin C, actinomycin D, mithramycin, prednisone, hydroxyprogesterone, testosterone, tamoxifen, dacarbazine, procarbazine, hexamethylmelamine, pentamethylmelamine, mitoxantrone, amsacrine, chlorambucil, methylcyclohexylnitrosurea, nitrogen mustards, melphalan, cyclophosphamide, 6-mercaptopurine, 6-thioguanine, cytarabine, 5-azacytidine, hydroxyurea, deoxycoformycin, 4-hydroxyperoxycyclophosphoramide, 5-fluorouracil (5-FU), 5-fluorodeoxyuridine (5-FUdR), methotrexate (MTX), colchicine, taxol, vincristine, vinblastine, etoposide (VP-16), trimetrexate, irinotecan, topotecan, gemcitabine, teniposide, cisplatin and diethylstilbestrol (DES). See, generally, The Merck Manual of Diagnosis and Therapy, 15th Ed. 1987, pp. 1206-1228, Berkow et al., eds., Rahway, N.J. When used with the compounds of the invention, such chemotherapeutic agents may be used individually (e.g., 5-FU and oligonucleotide), sequentially (e.g., 5-FU and oligonucleotide for a period of time followed by MTX and oligonucleotide), or in combination with one or more other such chemotherapeutic agents (e.g., 5-FU, MTX and oligonucleotide, or 5-FU, radiotherapy and oligonucleotide). Anti-inflammatory drugs, including but not limited to nonsteroidal anti-inflammatory drugs and corticosteroids, and antiviral drugs, including but not limited to ribivirin, vidarabine, acyclovir and ganciclovir, may also be combined in compositions of the invention. See, generally, The Merck Manual of Diagnosis and Therapy, 15th Ed., Berkow et al., eds., 1987, Rahway, N.J., pages 2499-2506 and 46-49, respectively). Other non-antisense chemotherapeutic agents are also within the scope of this invention. Two or more combined compounds may be used together or sequentially.

[0120] In another related embodiment, compositions of the invention may contain one or more antisense compounds, particularly oligonucleotides, targeted to a first nucleic acid and one or more additional antisense compounds targeted to a second nucleic acid target. Numerous examples of antisense compounds are known in the art. Two or more combined compounds may be used together or sequentially.

[0121] The formulation of therapeutic compositions and their subsequent administration is believed to be within the skill of those in the art. Dosing is dependent on severity and responsiveness of the disease state to be treated, with the course of treatment lasting from several days to several months, or until a cure is effected or a diminution of the disease state is achieved. Optimal dosing schedules can be calculated from measurements of drug accumulation in the body of the patient. Persons of ordinary skill can easily determine optimum dosages, dosing methodologies and repetition rates. Optimum dosages may vary depending on the relative potency of individual oligonucleotides, and can generally be estimated based on EC₅₀s found to be effective in in vitro and in vivo animal models. In general, dosage is from 0.01 ug to 100 g per kg of body weight, and may be given once or more daily, weekly, monthly or yearly, or even once every 2 to 20 years. Persons of ordinary skill in the art can easily estimate repetition rates for dosing based on measured residence times and concentrations of the drug in bodily fluids or tissues. Following successful treatment, it may be desirable to have the patient undergo maintenance therapy to prevent the recurrence of the disease state, wherein the oligonucleotide is administered in maintenance doses, ranging from 0.01 ug to 100 g per kg of body weight, once or more daily, to once every 20 years.

[0122] While the present invention has been described with specificity in accordance with certain of its preferred embodiments, the following examples serve only to illustrate the invention and are not intended to limit the same.

EXAMPLES Example 1

[0123] Nucleoside Phosphoramidites for Oligonucleotide Synthesis Deoxy and 2′-alkoxy amidites

[0124] 2′-Deoxy and 2′-methoxy beta-cyanoethyldiisopropyl phosphoramidites were purchased from commercial sources (e.g. Chemgenes, Needham Mass. or Glen Research, Inc. Sterling Va.). Other 2′-O-alkoxy substituted nucleoside amidites are prepared as described in U.S. Pat. No. 5,506,351, herein incorporated by reference. For oligonucleotides synthesized using 2′-alkoxy amidites, the standard cycle for unmodified oligonucleotides was utilized, except the wait step after pulse delivery of tetrazole and base was increased to 360 seconds.

[0125] Oligonucleotides containing 5-methyl-2′-deoxycytidine (5-Me-C) nucleotides were synthesized according to published methods [Sanghvi, et. al., Nucleic Acids Research, 1993, 21, 3197-3203] using commercially available phosphoramidites (Glen Research, Sterling Va. or ChemGenes, Needham Mass.).

[0126] 2′-Fluoro amidites

[0127] 2′-Fluorodeoxyadenosine amidites

[0128] 2′-fluoro oligonucleotides were synthesized as described previously [Kawasaki, et. al., J. Med. Chem., 1993, 36, 831-841] and U.S. Pat. No. 5,670,633, herein incorporated by reference. Briefly, the protected nucleoside N6-benzoyl-2′-deoxy-2′-fluoroadenosine was synthesized utilizing commercially available 9-beta-D-arabinofuranosyladenine as starting material and by modifying literature procedures whereby the 2′-alpha-fluoro atom is introduced by a S_(N)2-displacement of a 2′-beta-trityl group. Thus N6-benzoyl-9-beta-D-arabinofuranosyladenine was selectively protected in moderate yield as the 3′,5′-ditetrahydropyranyl (THP) intermediate. Deprotection of the THP and N6-benzoyl groups was accomplished using standard methodologies and standard methods were used to obtain the 5′-dimethoxytrityl-(DMT) and 5′-DMT-3′-phosphoramidite intermediates.

[0129] 2′-Fluorodeoxyguanosine

[0130] The synthesis of 2′-deoxy-2′-fluoroguanosine was accomplished using tetraisopropyldisiloxanyl (TPDS) protected 9-beta-D-arabinofuranosylguanine as starting material, and conversion to the intermediate diisobutyryl-arabinofuranosylguanosine. Deprotection of the TPDS group was followed by protection of the hydroxyl group with THP to give diisobutyryl di-THP protected arabinofuranosylguanine. Selective O-deacylation and triflation was followed by treatment of the crude product with fluoride, then deprotection of the THP groups. Standard methodologies were used to obtain the 5′-DMT- and 5′-DMT-3′-phosphoramidites.

[0131] 2′-Fluorouridine

[0132] Synthesis of 2′-deoxy-2′-fluorouridine was accomplished by the modification of a literature procedure in which 2,2′-anhydro-1-beta-D-arabinofuranosyluracil was treated with 70% hydrogen fluoride-pyridine. Standard procedures were used to obtain the 5′-DMT and 5′-DMT-3′phosphoramidites.

[0133] 2′-Fluorodeoxycytidine

[0134] 2′-deoxy-2′-fluorocytidine was synthesized via amination of 2′-deoxy-2′-fluorouridine, followed by selective protection to give N4-benzoyl-2′-deoxy-2′-fluorocytidine. Standard procedures were used to obtain the 5′-DMT and 5′-DMT-3′phosphoramidites.

[0135] 2′-O-(2-Methoxyethyl) modified amidites

[0136] 2′-O-Methoxyethyl-substituted nucleoside amidites are prepared as follows, or alternatively, as per the methods of Martin, P., Helvetica Chimica Acta, 1995, 78, 486-504.

[0137] 2,2′-Anhydro[1-(beta-D-arabinofuranosyl)-5-methyluridine]

[0138] 5-Methyluridine (ribosylthymine, commercially available through Yamasa, Choshi, Japan) (72.0 g , 0.279 M), diphenyl-carbonate (90.0 g, 0.420 M) and sodium bicarbonate (2.0 g , 0.024 M) were added to DMF (300 mL). The mixture was heated to reflux, with stirring, allowing the evolved carbon dioxide gas to be released in a controlled manner. After 1 hour, the slightly darkened solution was concentrated under reduced pressure. The resulting syrup was poured into diethylether (2.5 L), with stirring. The product formed a gum. The ether was decanted and the residue was dissolved in a minimum amount of methanol (ca. 400 mL). The solution was poured into fresh ether (2.5 L) to yield a stiff gum. The ether was decanted and the gum was dried in a vacuum oven (60° C. at 1 mm Hg for 24 h) to give a solid that was crushed to a light tan powder (57 g, 85% crude yield). The NMR spectrum was consistent with the structure, contaminated with phenol as its sodium salt (ca. 5%). The material was used as is for further reactions (or it can be purified further by column chromatography using a gradient of methanol in ethyl acetate (10-25%) to give a white solid, mp 222-4° C.).

[0139] 2′-O-Methoxyethyl-5-methyluridine

[0140] 2,2′-Anhydro-5-methyluridine (195 g, 0.81 M), tris(2-methoxyethyl)borate (231 g, 0.98 M) and 2-methoxyethanol (1.2 L) were added to a 2 L stainless steel pressure vessel and placed in a pre-heated oil bath at 160° C. After heating for 48 hours at 155-160° C., the vessel was opened and the solution evaporated to dryness and triturated with MeOH (200 mL). The residue was suspended in hot acetone (1 L). The insoluble salts were filtered, washed with acetone (150 mL) and the filtrate evaporated. The residue (280 g) was dissolved in CH₃CN (600 mL) and evaporated. A silica gel column (3 kg) was packed in CH₂Cl₂/acetone/MeOH (20:5:3) containing 0.5% Et₃NH. The residue was dissolved in CH₂Cl₂ (250 mL) and adsorbed onto silica (150 g) prior to loading onto the column. The product was eluted with the packing solvent to give 160 g (63%) of product. Additional material was obtained by reworking impure fractions.

[0141] 2′-O-Methoxyethyl-5′-O-dimethoxytrityl-5-methyluridine

[0142] 2′-O-Methoxyethyl-5-methyluridine (160 g, 0.506 M) was co-evaporated with pyridine (250 mL) and the dried residue dissolved in pyridine (1.3 L). A first aliquot of dimethoxytrityl chloride (94.3 g, 0.278 M) was added and the mixture stirred at room temperature for one hour. A second aliquot of dimethoxytrityl chloride (94.3 g, 0.278 M) was added and the reaction stirred for an additional one hour. Methanol (170 mL) was then added to stop the reaction. HPLC showed the presence of approximately 70% product. The solvent was evaporated and triturated with CH₃CN (200 mL). The residue was dissolved in CHCl₃ (1.5 L) and extracted with 2×500 mL of saturated NaHCO₃ and 2×500 mL of saturated NaCl. The organic phase was dried over Na₂SO₄, filtered and evaporated. 275 g of residue was obtained. The residue was purified on a 3.5 kg silica gel column, packed and eluted with EtOAc/hexane/acetone (5:5:1) containing 0.5% Et₃NH. The pure fractions were evaporated to give 164 g of product. Approximately 20 g additional was obtained from the impure fractions to give a total yield of 183 g (57%).

[0143] 3′-O-Acetyl-2′-O-methoxyethyl-5′-O-dimethoxytrityl-5-methyluridine

[0144] 2′-O-Methoxyethyl-5′-O-dimethoxytrityl-5-methyluridine (106 g, 0.167 M), DMF/pyridine (750 mL of a 3:1 mixture prepared from 562 mL of DMF and 188 mL of pyridine) and acetic anhydride (24.38 mL, 0.258 M) were combined and stirred at room temperature for 24 hours. The reaction was monitored by TLC by first quenching the TLC sample with the addition of MeOH. Upon completion of the reaction, as judged by TLC, MeOH (50 mL) was added and the mixture evaporated at 35° C. The residue was dissolved in CHCl₃ (800 mL) and extracted with 2×200 mL of saturated sodium bicarbonate and 2×200 mL of saturated NaCl. The water layers were back extracted with 200 mL of CHCl₃. The combined organics were dried with sodium sulfate and evaporated to give 122 g of residue (approx. 90% product). The residue was purified on a 3.5 kg silica gel column and eluted using EtOAc/hexane(4:1). Pure product fractions were evaporated to yield 96 g (84%). An additional 1.5 g was recovered from later fractions.

[0145] 3′-O-Acetyl-2′-O-methoxyethyl-5′-O-dimethoxytrityl-5-methyl-4-triazoleuridine

[0146] A first solution was prepared by dissolving 3′-O-acetyl-2′-O-methoxyethyl-5′-O-dimethoxytrityl-5-methyluridine (96 g, 0.144 M) in CH₃CN (700 mL) and set aside. Triethylamine (189 mL, 1.44 M) was added to a solution of triazole (90 g, 1.3 M) in CH₃CN (1 L), cooled to −5° C. and stirred for 0.5 h using an overhead stirrer. POCl₃ was added dropwise, over a 30 minute period, to the stirred solution maintained at 0-10° C., and the resulting mixture stirred for an additional 2 hours. The first solution was added dropwise, over a 45 minute period, to the latter solution. The resulting reaction mixture was stored overnight in a cold room. Salts were filtered from the reaction mixture and the solution was evaporated. The residue was dissolved in EtOAc (1 L) and the insoluble solids were removed by filtration. The filtrate was washed with 1×300 mL of NaHCO₃ and 2×300 mL of saturated NaCl, dried over sodium sulfate and evaporated. The residue was triturated with EtOAc to give the title compound.

[0147] 2′-O-Methoxyethyl-5′-O-dimethoxytrityl-5-methylcytidine

[0148] A solution of 3′-O-acetyl-2′-O-methoxyethyl-5′-O-dimethoxytrityl-5-methyl-4-triazoleuridine (103 g, 0.141 M) in dioxane (500 mL) and NH₄OH (30 mL) was stirred at room temperature for 2 hours. The dioxane solution was evaporated and the residue azeotroped with MeOH (2×200 mL). The residue was dissolved in MeOH (300 mL) and transferred to a 2 liter stainless steel pressure vessel. MeOH (400 mL) saturated with NH₃ g as was added and the vessel heated to 100° C. for 2 hours (TLC showed complete conversion). The vessel contents were evaporated to dryness and the residue was dissolved in EtOAc (500 mL) and washed once with saturated NaCl (200 mL). The organics were dried over sodium sulfate and the solvent was evaporated to give 85 g (95%) of the title compound.

[0149] N4-Benzoyl-2′-O-methoxyethyl-5′-O-dimethoxytrityl-5-methylcytidine

[0150] 2′-O-Methoxyethyl-5′-O-dimethoxytrityl-5-methylcytidine (85 g, 0.134 M) was dissolved in DMF (800 mL) and benzoic anhydride (37.2 g, 0.165 M) was added with stirring. After stirring for 3 hours, TLC showed the reaction to be approximately 95% complete. The solvent was evaporated and the residue azeotroped with MeOH (200 mL). The residue was dissolved in CHCl₃ (700 mL) and extracted with saturated NaHCO₃ (2×300 mL) and saturated NaCl (2×300 mL), dried over MgSO₄ and evaporated to give a residue (96 g). The residue was chromatographed on a 1.5 kg silica column using EtOAc/hexane (1:1) containing 0.5% Et₃NH as the eluting solvent. The pure product fractions were evaporated to give 90 g (90%) of the title compound.

[0151] N4-Benzoyl-2′-O-methoxyethyl-5′-O-dimethoxytrityl-5-methylcytidine-3′-amidite

[0152] N4-Benzoyl-2′-O-methoxyethyl-5′-O-dimethoxytrityl-5-methylcytidine (74 g, 0.10 M) was dissolved in CH₂Cl₂ (1 L). Tetrazole diisopropylamine (7.1 g ) and 2-cyanoethoxy-tetra-(isopropyl)phosphite (40.5 mL, 0.123 M) were added with stirring, under a nitrogen atmosphere. The resulting mixture was stirred for 20 hours at room temperature (TLC showed the reaction to be 95% complete). The reaction mixture was extracted with saturated NaHCO₃ (1×300 mL) and saturated NaCl (3×300 mL). The aqueous washes were back-extracted with CH₂Cl₂ (300 mL), and the extracts were combined, dried over MgSO₄ and concentrated. The residue obtained was chromatographed on a 1.5 kg silica column using EtOAc/hexane (3:1) as the eluting solvent. The pure fractions were combined to give 90.6 g (87%) of the title compound.

[0153] 2′-O-(Aminooxyethyl) nucleoside amidites and 2′-O-(dimethylaminooxyethyl) nucleoside amidites

[0154] 2′-(Dimethylaminooxyethoxy) nucleoside amidites

[0155] 2′-(Dimethylaminooxyethoxy) nucleoside amidites [also known in the art as 2′-O-(dimethylaminooxyethyl) nucleoside amidites] are prepared as described in the following paragraphs. Adenosine, cytidine and guanosine nucleoside amidites are prepared similarly to the thymidine (5-methyluridine) except the exocyclic amines are protected with a benzoyl moiety in the case of adenosine and cytidine and with isobutyryl in the case of guanosine.

[0156] 5′-O-tert-Butyldiphenylsilyl-O²-2′-anhydro-5-methyluridine

[0157] O²-2′-anhydro-5-methyluridine (Pro. Bio. Sint., Varese, Italy, 100.0 g, 0.416 mmol), dimethylaminopyridine (0.66 g, 0.013 eq, 0.0054 mmol) were dissolved in dry pyridine (500 ml) at ambient temperature under an argon atmosphere and with mechanical stirring. tert-Butyldiphenylchlorosilane (125.8 g, 119.0 mL, 1.1 eq, 0.458 mmol) was added in one portion. The reaction was stirred for 16 h at ambient temperature. TLC (Rf 0.22, ethyl acetate) indicated a complete reaction. The solution was concentrated under reduced pressure to a thick oil. This was partitioned between dichloromethane (1 L) and saturated sodium bicarbonate (2×1 L) and brine (1 L). The organic layer was dried over sodium sulfate and concentrated under reduced pressure to a thick oil. The oil was dissolved in a 1:1 mixture of ethyl acetate and ethyl ether (600 mL) and the solution was cooled to −10° C. The resulting crystalline product was collected by filtration, washed with ethyl ether (3×200 mL) and dried (40° C., 1 mm Hg, 24 h) to 149 g (74.8%) of white solid. TLC and NMR were consistent with pure product.

[0158] 5′-O-tert-Butyldiphenylsilyl-2′-O-(2-hydroxyethyl)-5-methyluridine

[0159] In a 2 L stainless steel, unstirred pressure reactor was added borane in tetrahydrofuran (1.0 M, 2.0 eq, 622 mL). In the fume hood and with manual stirring, ethylene glycol (350 mL, excess) was added cautiously at first until the evolution of hydrogen gas subsided. 5′-O-tert-Butyldiphenylsilyl-O²-2′-anhydro-5-methyluridine (149 g, 0.311 mol) and sodium bicarbonate (0.074 g, 0.003 eq) were added with manual stirring. The reactor was sealed and heated in an oil bath until an internal temperature of 160 ° C. was reached and then maintained for 16 h (pressure<100 psig). The reaction vessel was cooled to ambient and opened. TLC (Rf 0.67 for desired product and Rf 0.82 for ara-T side product, ethyl acetate) indicated about 70% conversion to the product. In order to avoid additional side product formation, the reaction was stopped, concentrated under reduced pressure (10 to 1 mm Hg) in a warm water bath (40-100° C.) with the more extreme conditions used to remove the ethylene glycol. [Alternatively, once the low boiling solvent is gone, the remaining solution can be partitioned between ethyl acetate and water. The product will be in the organic phase.] The residue was purified by column chromatography (2 kg silica gel, ethyl acetate-hexanes gradient 1:1 to 4:1). The appropriate fractions were combined, stripped and dried to product as a white crisp foam (84 g, 50%), contaminated starting material (17.4 g) and pure reusable starting material 20 g. The yield based on starting material less pure recovered starting material was 58%. TLC and NMR were consistent with 99% pure product.

[0160] 2′-O-([2-phthalimidoxy)ethyl]-5′-t-butyldiphenylsilyl-5-methyluridine

[0161] 5′-O-tert-Butyldiphenylsilyl-2′-O-(2-hydroxyethyl)-5-methyluridine (20 g, 36.98 mmol) was mixed with triphenylphosphine (11.63 g, 44.36 mmol) and N-hydroxyphthalimide (7.24 g, 44.36 mmol). It was then dried over P₂O₅ under high vacuum for two days at 40° C. The reaction mixture was flushed with argon and dry THF (369.8 mL, Aldrich, sure seal bottle) was added to get a clear solution. Diethyl-azodicarboxylate (6.98 mL, 44.36 mmol) was added dropwise to the reaction mixture. The rate of addition is maintained such that resulting deep red coloration is just discharged before adding the next drop. After the addition was complete, the reaction was stirred for 4 hrs. By that time TLC showed the completion of the reaction (ethylacetate:hexane, 60:40). The solvent was evaporated in vacuum. Residue obtained was placed on a flash column and eluted with ethyl acetate:hexane (60:40), to get 2′-O-([2-phthalimidoxy)ethyl]-5′-t-butyldiphenylsilyl-5-methyluridine as white foam (21.819 g, 86%).

[0162] 5′-O-tert-butyldiphenylsilyl-2′-O-[(2-formadoximinooxy)ethyl]-5-methyluridine

[0163] 2′-O-([2-phthalimidoxy)ethyl]-5′-t-butyldiphenylsilyl-5-methyluridine (3.1 g, 4.5 mmol) was dissolved in dry CH₂Cl₂ (4.5 mL) and methylhydrazine (300 mL, 4.64 mmol) was added dropwise at −10° C. to 0° C. After 1 h the mixture was filtered, the filtrate was washed with ice cold CH₂Cl₂ and the combined organic phase was washed with water, brine and dried over anhydrous Na₂SO₄. The solution was concentrated to get 2′-O-(aminooxyethyl) thymidine, which was then dissolved in MeOH (67.5 mL). To this formaldehyde (20% aqueous solution, w/w, 1.1 eq.) was added and the resulting mixture was strirred for 1 h. Solvent was removed under vacuum; residue chromatographed to get 5′-O-tert-butyldiphenylsilyl-2′-O-[(2-formadoximinooxy) ethyl]-5-methyluridine as white foam (1.95 g, 78%).

[0164] 5′-O-tert-Butyldiphenylsilyl-2′-O-[N,N-dimethylaminooxyethyl]-5-methyluridine

[0165] 5′-O-tert-butyldiphenylsilyl-2′-O-[(2-formadoximinooxy)ethyl]-5-methyluridine (1.77 g, 3.12 mmol) was dissolved in a solution of 1M pyridinium p-toluenesulfonate (PPTS) in dry MeOH (30.6 mL). Sodium cyanoborohydride (0.39 g, 6.13 mmol) was added to this solution at 10° C. under inert atmosphere. The reaction mixture was stirred for 10 minutes at 10° C. After that the reaction vessel was removed from the ice bath and stirred at room temperature for 2 h, the reaction monitored by TLC (5% MeOH in CH₂Cl₂). Aqueous NaHCO₃ solution (5%, 10 mL) was added and extracted with ethyl acetate (2×20 mL). Ethyl acetate phase was dried over anhydrous Na₂SO₄, evaporated to dryness. Residue was dissolved in a solution of 1M PPTS in MeOH (30.6 mL). Formaldehyde (20% w/w, 30 mL, 3.37 mmol) was added and the reaction mixture was stirred at room temperature for 10 minutes. Reaction mixture cooled to 10° C. in an ice bath, sodium cyanoborohydride (0.39 g, 6.13 mmol) was added and reaction mixture stirred at 10° C. for 10 minutes. After 10 minutes, the reaction mixture was removed from the ice bath and stirred at room temperature for 2 hrs. To the reaction mixture 5% NaHCO₃ (25 mL) solution was added and extracted with ethyl acetate (2×25 mL). Ethyl acetate layer was dried over anhydrous Na₂SO₄ and evaporated to dryness . The residue obtained was purified by flash column chromatography and eluted with 5% MeOH in CH₂Cl₂ to get 5′-O-tert-butyldiphenylsilyl-2′-O-[N,N-dimethylaminooxyethyl]-5-methyluridine as a white foam (14.6 g, 80%).

[0166] 2′-O-(dimethylaminooxyethyl)-5-methyluridine

[0167] Triethylamine trihydrofluoride (3.9 mL, 24.0 mmol) was dissolved in dry THF and triethylamine (1.67 mL, 12 mmol, dry, kept over KOH). This mixture of triethylamine-2HF was then added to 5′-O-tert-butyldiphenylsilyl-2′-O-[N,N-dimethylaminooxyethyl]-5-methyluridine (1.40 g, 2.4 mmol) and stirred at room temperature for 24 hrs. Reaction was monitored by TLC (5% MeOH in CH₂Cl₂). Solvent was removed under vacuum and the residue placed on a flash column and eluted with 10% MeOH in CH₂Cl₂ to get 2′-O-(dimethylaminooxyethyl)-5-methyluridine (766 mg, 92.5%).

[0168] 5′-O-DMT-2′-O-(dimethylaminooxyethyl)-5-methyluridine

[0169] 2′-O-(dimethylaminooxyethyl)-5-methyluridine (750 mg, 2.17 mmol) was dried over P₂O₅ under high vacuum overnight at 40° C. It was then co-evaporated with anhydrous pyridine (20 mL). The residue obtained was dissolved in pyridine (11 mL) under argon atmosphere. 4-dimethylaminopyridine (26.5 mg, 2.60 mmol), 4,4′-dimethoxytrityl chloride (880 mg, 2.60 mmol) was added to the mixture and the reaction mixture was stirred at room temperature until all of the starting material disappeared. Pyridine was removed under vacuum and the residue chromatographed and eluted with 10% MeOH in CH₂Cl₂ (containing a few drops of pyridine) to get 5′-O-DMT-2′-O-(dimethylamino-oxyethyl)-5-methyluridine (1.13 g, 80%).

[0170] 5′-O-DMT-2′-O-(2-N,N-dimethylaminooxyethyl)-5-methyluridine-3′-[(2-cyanoethyl)-N,N-diisopropylphosphoramidite]

[0171] 5′-O-DMT-2′-O-(dimethylaminooxyethyl)-5-methyluridine (1.08 g, 1.67 mmol) was co-evaporated with toluene (20 mL). To the residue N,N-diisopropylamine tetrazonide (0.29 g, 1.67 mmol) was added and dried over P₂O₅ under high vacuum overnight at 40° C. Then the reaction mixture was dissolved in anhydrous acetonitrile (8.4 mL) and 2-cyanoethyl-N,N,N¹,N¹-tetraisopropylphosphoramidite (2.12 mL, 6.08 mmol) was added. The reaction mixture was stirred at ambient temperature for 4 hrs under inert atmosphere. The progress of the reaction was monitored by TLC (hexane:ethyl acetate 1:1). The solvent was evaporated, then the residue was dissolved in ethyl acetate (70 mL) and washed with 5% aqueous NaHCO₃ (40 mL). Ethyl acetate layer was dried over anhydrous Na₂SO₄ and concentrated. Residue obtained was chromatographed (ethyl acetate as eluent) to get 5′-O-DMT-2′-O-(2-N,N-dimethylaminooxyethyl)-5-methyluridine-3′-[(2-cyanoethyl)-N,N-diisopropylphosphoramidite] as a foam (1.04 g, 74.9%).

[0172] 2′-(Aminooxyethoxy) nucleoside amidites

[0173] 2′-(Aminooxyethoxy) nucleoside amidites [also known in the art as 2′-O-(aminooxyethyl) nucleoside amidites] are prepared as described in the following paragraphs. Adenosine, cytidine and thymidine nucleoside amidites are prepared similarly.

[0174] N2-isobutyryl-6-O-diphenylcarbamoyl-2′-O-(2-ethylacetyl) -5′-O-(4,4′-dimethoxytrityl)guanosine-3′-[(2-cyanoethyl)-N, N-diisopropylphosphoramidite]

[0175] The 2′-O-aminooxyethyl guanosine analog may be obtained by selective 2-O-alkylation of diaminopurine riboside. Multigram quantities of diaminopurine riboside may be purchased from Schering AG (Berlin) to provide 2′-O-(2-ethylacetyl) diaminopurine riboside along with a minor amount of the 3′-O-isomer. 2′-O-(2-ethylacetyl) diaminopurine riboside may be resolved and converted to 2′-O-(2-ethylacetyl)guanosine by treatment with adenosine deaminase. (McGee, D. P. C., Cook, P. D., Guinosso, C. J., WO 94/02501 A1 940203.) Standard protection procedures should afford 2′-O-(2-ethylacetyl)-5′-O-(4,4′-dimethoxytrityl) guanosine and 2-N-isobutyryl-6-O-diphenylcarbamoyl-2′-O-(2-ethylacetyl) -5′-O-(4,4′-dimethoxytrityl)guanosine which may be reduced to provide 2-N-isobutyryl-6-O-diphenylcarbamoyl-2′-O-(2-hydroxyethyl)-5′-O-(4,4′-dimethoxytrityl)guanosine. As before the hydroxyl group may be displaced by N-hydroxyphthalimide via a Mitsunobu reaction, and the protected nucleoside may phosphitylated as usual to yield 2-N-isobutyryl-6-O-diphenylcarbamoyl-2′-O-([2-phthalmidoxy]ethyl) -5′-O-(4,4′-dimethoxytrityl)guanosine-3′-[(2-cyanoethyl)-N,N-diisopropylphosphoramidite].

[0176] 2′-dimethylaminoethoxyethoxy (2′-DMAEOE) nucleoside amidites

[0177] 2′-dimethylaminoethoxyethoxy nucleoside amidites (also known in the art as 2′-O-dimethylaminoethoxyethyl, i.e., 2′-O—CH₂—O—CH₂—N(CH₂)₂, or 2′-DMAEOE nucleoside amidites) are prepared as follows. Other nucleoside amidites are prepared similarly.

[0178] 2′-O-[2(2-N,N-dimethylaminoethoxy)ethyl]-5-methyl uridine

[0179] 2[2-(Dimethylamino)ethoxy]ethanol (Aldrich, 6.66 g, 50 mmol) is slowly added to a solution of borane in tetra-hydrofuran (1 M, 10 mL, 10 mmol) with stirring in a 100 mL bomb. Hydrogen gas evolves as the solid dissolves. O²-,2′-anhydro-5-methyluridine (1.2 g, 5 mmol), and sodium bicarbonate (2.5 mg) are added and the bomb is sealed, placed in an oil bath and heated to 155° C. for 26 hours. The bomb is cooled to room temperature and opened. The crude solution is concentrated and the residue partitioned between water (200 mL) and hexanes (200 mL). The excess phenol is extracted into the hexane layer. The aqueous layer is extracted with ethyl acetate (3×200 mL) and the combined organic layers are washed once with water, dried over anhydrous sodium sulfate and concentrated. The residue is columned on silica gel using methanol/methylene chloride 1:20 (which has 2% triethylamine) as the eluent. As the column fractions are concentrated a colorless solid forms which is collected to give the title compound as a white solid.

[0180] 5′-O-dimethoxytrityl-2′-O-[2(2-N,N-dimethylaminoethoxy) ethyl)]-5-methyl uridine

[0181] To 0.5 g (1.3 mmol) of 2′-O-[2(2-N,N-dimethylamino-ethoxy)ethyl)]-5-methyl uridine in anhydrous pyridine (8 mL), triethylamine (0.36 mL) and dimethoxytrityl chloride (DMT-Cl, 0.87 g, 2 eq.) are added and stirred for 1 hour. The reaction mixture is poured into water (200 mL) and extracted with CH₂Cl₂ (2×200 mL). The combined CH₂Cl₂ layers are washed with saturated NaHCO₃ solution, followed by saturated NaCl solution and dried over anhydrous sodium sulfate. Evaporation of the solvent followed by silica gel chromatography using MeOH:CH₂Cl₂:Et₃N (20:1, v/v, with 1% triethylamine) gives the title compound.

[0182] 5′-O-Dimethoxytrityl-2′-O-[2(2-N,N-dimethylaminoethoxy)-ethyl)]-5-methyl uridine-3′-O-(cyanoethyl-N,N-diisopropyl)phosphoramidite

[0183] Diisopropylaminotetrazolide (0.6 g) and 2-cyanoethoxy-N,N-diisopropyl phosphoramidite (1.1 mL, 2 eq.) are added to a solution of 5′-O-dimethoxytrityl-2′-O-[2(2-N,N-dimethylaminoethoxy)ethyl)]-5-methyluridine (2.17 g, 3 mmol) dissolved in CH₂Cl₂ (20 mL) under an atmosphere of argon. The reaction mixture is stirred overnight and the solvent evaporated. The resulting residue is purified by silica gel flash column chromatography with ethyl acetate as the eluent to give the title compound.

Example 2

[0184] Oligonucleotide Synthesis

[0185] Unsubstituted and substituted phosphodiester (P═O) oligonucleotides are synthesized on an automated DNA synthesizer (Applied Biosystems model 380B) using standard phosphoramidite chemistry with oxidation by iodine.

[0186] Phosphorothioates (P═S) are synthesized as for the phosphodiester oligonucleotides except the standard oxidation bottle was replaced by 0.2 M solution of 3H-1,2-benzodithiole-3-one 1,1-dioxide in acetonitrile for the stepwise thiation of the phosphite linkages. The thiation wait step was increased to 68 sec and was followed by the capping step. After cleavage from the CPG column and deblocking in concentrated ammonium hydroxide at 55° C. (18 h), the oligonucleotides were purified by precipitating twice with 2.5 volumes of ethanol from a 0.5 M NaCl solution.

[0187] Phosphinate oligonucleotides are prepared as described in U.S. Pat. No. 5,508,270, herein incorporated by reference.

[0188] Alkyl phosphonate oligonucleotides are prepared as described in U.S. Pat. No. 4,469,863, herein incorporated by reference.

[0189] 3′-Deoxy-3′-methylene phosphonate oligonucleotides are prepared as described in U.S. Pat. Nos. 5,610,289 or 5,625,050, herein incorporated by reference.

[0190] Phosphoramidite oligonucleotides are prepared as described in U.S. Pat. No. 5,256,775 or U.S. Pat. No. 5,366,878, herein incorporated by reference.

[0191] Alkylphosphonothioate oligonucleotides are prepared as described in published PCT applications PCT/US94/00902 and PCT/US93/06976 (published as WO 94/17093 and WO 94/02499, respectively), herein incorporated by reference.

[0192] 3′-Deoxy-3′-amino phosphoramidate oligonucleotides are prepared as described in U.S. Pat. No. 5,476,925, herein incorporated by reference.

[0193] Phosphotriester oligonucleotides are prepared as described in U.S. Pat. No. 5,023,243, herein incorporated by reference.

[0194] Borano phosphate oligonucleotides are prepared as described in U.S. Pat. Nos. 5,130,302 and 5,177,198, both herein incorporated by reference.

Example 3

[0195] Oligonucleoside Synthesis

[0196] Methylenemethylimino linked oligonucleosides, also identified as MMI linked oligonucleosides, methylenedimethyl-hydrazo linked oligonucleosides, also identified as MDH linked oligonucleosides, and methylenecarbonylamino linked oligonucleosides, also identified as amide-3 linked oligonucleosides, and methyleneaminocarbonyl linked oligo-nucleosides, also identified as amide-4 linked oligonucleo-sides, as well as mixed backbone compounds having, for instance, alternating MMI and P═O or P═S linkages are prepared as described in U.S. Pat. Nos. 5,378,825, 5,386,023, 5,489,677, 5,602,240 and 5,610,289, all of which are herein incorporated by reference.

[0197] Formacetal and thioformacetal linked oligonucleosides are prepared as described in U.S. Pat. Nos. 5,264,562 and 5,264,564, herein incorporated by reference.

[0198] Ethylene oxide linked oligonucleosides are prepared as described in U.S. Pat. No. 5,223,618, herein incorporated by reference.

Example 4

[0199] PNA Synthesis

[0200] Peptide nucleic acids (PNAs) are prepared in accordance with any of the various procedures referred to in Peptide Nucleic Acids (PNA): Synthesis, Properties and Potential Applications, Bioorganic & Medicinal Chemistry, 1996, 4, 5-23. They may also be prepared in accordance with U.S. Pat. Nos. 5,539,082, 5,700,922, and 5,719,262, herein incorporated by reference.

Example 5

[0201] Synthesis of Chimeric Oligonucleotides

[0202] Chimeric oligonucleotides, oligonucleosides or mixed oligonucleotides/oligonucleosides of the invention can be of several different types. These include a first type wherein the “gap” segment of linked nucleosides is positioned between 5′ and 3′ “wing” segments of linked nucleosides and a second “open end” type wherein the “gap” segment is located at either the 3′ or the 5′ terminus of the oligomeric compound. Oligonucleotides of the first type are also known in the art as “gapmers” or gapped oligonucleotides. Oligonucleotides of the second type are also known in the art as “hemimers” or “wingmers”.

[0203] [2′-O-Me]-[2′-deoxy]-[2′-O-Me] Chimeric Phosphorothioate Oligonucleotides

[0204] Chimeric oligonucleotides having 2′-O-alkyl phosphorothioate and 2′-deoxy phosphorothioate oligo-nucleotide segments are synthesized using an Applied Biosystems automated DNA synthesizer Model 380B, as above. Oligonucleotides are synthesized using the automated synthesizer and 2′-deoxy-5′-dimethoxytrityl-3′-O-phosphor-amidite for the DNA portion and 5′-dimethoxytrityl-2′-O-methyl-3′-O-phosphoramidite for 5′ and 3′ wings. The standard synthesis cycle is modified by increasing the wait step after the delivery of tetrazole and base to 600 s repeated four times for RNA and twice for 2′-O-methyl. The fully protected oligonucleotide is cleaved from the support and the phosphate group is deprotected in 3:1 ammonia/ethanol at room temperature overnight then lyophilized to dryness. Treatment in methanolic ammonia for 24 hrs at room temperature is then done to deprotect all bases and sample was again lyophilized to dryness. The pellet is resuspended in 1M TBAF in THF for 24 hrs at room temperature to deprotect the 2′ positions. The reaction is then quenched with 1M TEAA and the sample is then reduced to ½ volume by rotovac before being desalted on a G25 size exclusion column. The oligo recovered is then analyzed spectrophotometrically for yield and for purity by capillary electrophoresis and by mass spectrometry.

[0205] [2′-O-(2-Methoxyethyl)]-[2′-deoxy]-[2′-O-(Methoxyethyl)] Chimeric Phosphorothioate Oligonucleotides

[0206] [2′-O-(2-methoxyethyl)]-[2′-deoxy]-[-2′-O-(methoxy-ethyl)] chimeric phosphorothioate oligonucleotides were prepared as per the procedure above for the 2′-O-methyl chimeric oligonucleotide, with the substitution of 2′-O-(methoxyethyl) amidites for the 2′-O-methyl amidites.

[0207] [2′-O-(2-Methoxyethyl)Phosphodiester]-[2′-deoxy Phosphorothioate]-[2′-O-(2-Methoxyethyl) Phosphodiester] Chimeric Oligonucleotides

[0208] [2′-O-(2-methoxyethyl phosphodiester]-[2′-deoxy phos-phorothioate]-[2′-O-(methoxyethyl) phosphodiester] chimeric oligonucleotides are prepared as per the above procedure for the 2′-O-methyl chimeric oligonucleotide with the substitution of 2′-O-(methoxyethyl) amidites for the 2′-O-methyl amidites, oxidization with iodine to generate the phosphodiester internucleotide linkages within the wing portions of the chimeric structures and sulfurization utilizing 3,H-1,2 benzodithiole-3-one 1,1 dioxide (Beaucage Reagent) to generate the phosphorothioate internucleotide linkages for the center gap.

[0209] Other chimeric oligonucleotides, chimeric oligonucleo-sides and mixed chimeric oligonucleotides/oligonucleosides are synthesized according to U.S. Pat. No. 5,623,065, herein incorporated by reference.

Example 6

[0210] Oligonucleotide Isolation

[0211] After cleavage from the controlled pore glass column (Applied Biosystems) and deblocking in concentrated ammonium hydroxide at 55° C. for 18 hours, the oligonucleotides or oligonucleosides are purified by precipitation twice out of 0.5 M NaCl with 2.5 volumes ethanol. Synthesized oligonucleotides were analyzed by polyacrylamide gel electrophoresis on denaturing gels and judged to be at least 85% full length material. The relative amounts of phosphorothioate and phosphodiester linkages obtained in synthesis were periodically checked by ³¹P nuclear magnetic resonance spectroscopy, and for some studies oligonucleotides were purified by HPLC, as described by Chiang et al., J. Biol. Chem. 1991, 266, 18162-18171. Results obtained with HPLC-purified material were similar to those obtained with non-HPLC purified material.

Example 7

[0212] Oligonucleotide Synthesis—96Well Plate Format

[0213] Oligonucleotides were synthesized via solid phase P(III) phosphoramidite chemistry on an automated synthesizer capable of assembling 96 sequences simultaneously in a standard 96 well format. Phosphodiester internucleotide linkages were afforded by oxidation with aqueous iodine. Phosphorothioate internucleotide linkages were generated by sulfurization utilizing 3,H-1,2 benzodithiole-3-one 1,1 dioxide (Beaucage Reagent) in anhydrous acetonitrile. Standard base-protected beta-cyanoethyldiisopropyl phosphoramidites were purchased from commercial vendors (e.g. PE-Applied Biosystems, Foster City, Calif., or Pharmacia, Piscataway, N.J.). Non-standard nucleosides are synthesized as per known literature or patented methods. They are utilized as base protected beta-cyanoethyldiisopropyl phosphoramidites.

[0214] Oligonucleotides were cleaved from support and deprotected with concentrated NH₄OH at elevated temperature (55-60° C.) for 12-16 hours and the released product then dried in vacuo. The dried product was then re-suspended in sterile water to afford a master plate from which all analytical and test plate samples are then diluted utilizing robotic pipettors.

Example 8

[0215] Oligonucleotide Analysis—96Well Plate Format

[0216] The concentration of oligonucleotide in each well was assessed by dilution of samples and UV absorption spectroscopy. The full-length integrity of the individual products was evaluated by capillary electrophoresis (CE) in either the 96 well format (Beckman P/ACE™ MDQ) or, for individually prepared samples, on a commercial CE apparatus (e.g., Beckman P/ACE™ 5000, ABI 270). Base and backbone composition was confirmed by mass analysis of the compounds utilizing electrospray-mass spectroscopy. All assay test plates were diluted from the master plate using single and multi-channel robotic pipettors. Plates were judged to be acceptable if at least 85% of the compounds on the plate were at least 85% full length.

Example 9

[0217] Cell Culture and Oligonucleotide Treatment

[0218] The effect of antisense compounds on target nucleic acid expression can be tested in any of a variety of cell types provided that the target nucleic acid is present at measurable levels. This can be routinely determined using, for example, PCR or Northern blot analysis. The following 5 cell types are provided for illustrative purposes, but other cell types can be routinely used, provided that the target is expressed in the cell type chosen. This can be readily determined by methods routine in the art, for example Northern blot analysis, Ribonuclease protection assays, or RT-PCR.

[0219] T-24 Cells:

[0220] The human transitional cell bladder carcinoma cell line T-24 was obtained from the American Type Culture Collection (ATCC) (Manassas, Va.). T-24 cells were routinely cultured in complete McCoy's 5A basal media (Gibco/Life Technologies, Gaithersburg, Md.) supplemented with 10% fetal calf serum (Gibco/Life Technologies, Gaithersburg, Md.), penicillin 100 units per mL, and streptomycin 100 micrograms per mL (Gibco/Life Technologies, Gaithersburg, Md.). Cells were routinely passaged by trypsinization and dilution when they reached 90% confluence. Cells were seeded into 96-well plates (Falcon-Primaria #3872) at a density of 7000 cells/well for use in RT-PCR analysis.

[0221] For Northern blotting or other analysis, cells may be seeded onto 100 mm or other standard tissue culture plates and treated similarly, using appropriate volumes of medium and oligonucleotide.

[0222] A549 Cells:

[0223] The human lung carcinoma cell line A549 was obtained from the American Type Culture Collection (ATCC) (Manassas, Va.). A549 cells were routinely cultured in DMEM basal media (Gibco/Life Technologies, Gaithersburg, Md.) supplemented with 10% fetal calf serum (Gibco/Life Technologies, Gaithersburg, Md.), penicillin 100 units per mL, and streptomycin 100 micrograms per mL (Gibco/Life Technologies, Gaithersburg, Md.). Cells were routinely passaged by trypsinization and dilution when they reached 90% confluence.

[0224] NHDF Cells:

[0225] Human neonatal dermal fibroblast (NHDF) were obtained from the Clonetics Corporation (Walkersville Md.). NHDFs were routinely maintained in Fibroblast Growth Medium (Clonetics Corporation, Walkersville Md.) supplemented as recommended by the supplier. Cells were maintained for up to 10 passages as recommended by the supplier.

[0226] HEK Cells:

[0227] Human embryonic keratinocytes (HEK) were obtained from the Clonetics Corporation (Walkersville Md.). HEKs were routinely maintained in Keratinocyte Growth Medium (Clonetics Corporation, Walkersville Md.) formulated as recommended by the supplier. Cells were routinely maintained for up to 10 passages as recommended by the supplier.

[0228] HepG2 Cells:

[0229] The human hepatoblastoma cell line HepG2 was obtained from the American Type Culture Collection (Manassas, Va.). HepG2 cells were routinely cultured in Eagle's MEM supplemented with 10% fetal calf serum, non-essential amino acids, and 1 mM sodium pyruvate (Gibco/Life Technologies, Gaithersburg, Md.). Cells were routinely passaged by trypsinization and dilution when they reached 90% confluence. Cells were seeded into 96-well plates (Falcon-Primaria #3872) at a density of 7000 cells/well for use in RT-PCR analysis.

[0230] For Northern blotting or other analyses, cells may be seeded onto 100 mm or other standard tissue culture plates and treated similarly, using appropriate volumes of medium and oligonucleotide.

[0231] Treatment with Antisense Compounds:

[0232] When cells reached 80% confluency, they were treated with oligonucleotide. For cells grown in 96-well plates, wells were washed once with 200 μL OPTI-MEM™-1 reduced-serum medium (Gibco BRL) and then treated with 130 μL of OPTI-MEM™-1 containing 3.75 μg/mL LIPOFECTIN™ (Gibco BRL) and the desired concentration of oligonucleotide. After 4-7 hours of treatment, the medium was replaced with fresh medium. Cells were harvested 16-24 hours after oligonucleotide treatment.

[0233] The concentration of oligonucleotide used varies from cell line to cell line. To determine the optimal oligonucleotide concentration for a particular cell line, the cells are treated with a positive control oligonucleotide at a range of concentrations. For human cells the positive control oligonucleotide is ISIS 13920, TCCGTCATCGCTCCTCAGGG, SEQ ID NO: 1, a 2′-O-methoxyethyl gapmer (2′-O-methoxyethyls shown in bold) with a phosphorothioate backbone which is targeted to human H-ras. For mouse or rat cells the positive control oligonucleotide is ISIS 15770, ATGCATTCTGCCCCCAAGGA, SEQ ID NO: 2, a 2′-O-methoxyethyl gapmer (2′-O-methoxyethyls shown in bold) with a phosphorothioate backbone which is targeted to both mouse and rat c-raf. The concentration of positive control oligonucleotide that results in 80% inhibition of c-Ha-ras (for ISIS 13920) or c-raf (for ISIS 15770) mRNA is then utilized as the screening concentration for new oligonucleotides in subsequent experiments for that cell line. If 80% inhibition is not achieved, the lowest concentration of positive control oligonucleotide that results in 60% inhibition of H-ras or c-raf mRNA is then utilized as the oligonucleotide screening concentration in subsequent experiments for that cell line. If 60% inhibition is not achieved, that particular cell line is deemed as unsuitable for oligonucleotide transfection experiments.

Example 10

[0234] Analysis of Oligonucleotide Inhibition of FXR Expression

[0235] Antisense modulation of FXR expression can be assayed in a variety of ways known in the art. For example, FXR mRNA levels can be quantitated by, e.g., Northern blot analysis, competitive polymerase chain reaction (PCR), or real-time PCR (RT-PCR). Real-time quantitative PCR is presently preferred. RNA analysis can be performed on total cellular RNA or poly(A)+mRNA. Methods of RNA isolation are taught in, for example, Ausubel, F. M. et al., Current Protocols in Molecular Biology, Volume 1, pp. 4.1.1-4.2.9 and 4.5.1-4.5.3, John Wiley & Sons, Inc., 1993. Northern blot analysis is routine in the art and is taught in, for example, Ausubel, F. M. et al., Current Protocols in Molecular Biology, Volume 1, pp. 4.2.1-4.2.9, John Wiley & Sons, Inc., 1996. Real-time quantitative (PCR) can be conveniently accomplished using the commercially available ABI PRISM™ 7700 Sequence Detection System, available from PE-Applied Biosystems, Foster City, Calif. and used according to manufacturer's instructions.

[0236] Protein levels of FXR can be quantitated in a variety of ways well known in the art, such as immunoprecipitation, Western blot analysis (immunoblotting), ELISA or fluorescence-activated cell sorting (FACS). Antibodies directed to FXR can be identified and obtained from a variety of sources, such as the MSRS catalog of antibodies (Aerie Corporation, Birmingham, Mich.), or can be prepared via conventional antibody generation methods. Methods for preparation of polyclonal antisera are taught in, for example, Ausubel, F. M. et al., Current Protocols in Molecular Biology, Volume 2, pp. 11.12.1-11.12.9, John Wiley & Sons, Inc., 1997. Preparation of monoclonal antibodies is taught in, for example, Ausubel, F. M. et al., Current Protocols in Molecular Biology, Volume 2, pp. 11.4.1-11.11.5, John Wiley & Sons, Inc., 1997.

[0237] Immunoprecipitation methods are standard in the art and can be found at, for example, Ausubel, F. M. et al., Current Protocols in Molecular Biology, Volume 2, pp. 10.16.1-10.16.11, John Wiley & Sons, Inc., 1998. Western blot (immunoblot) analysis is standard in the art and can be found at, for example, Ausubel, F. M. et al., Current Protocols in Molecular Biology, Volume 2, pp. 10.8.1-10.8.21, John Wiley & Sons, Inc., 1997. Enzyme-linked immunosorbent assays (ELISA) are standard in the art and can be found at, for example, Ausubel, F. M. et al., Current Protocols in Molecular Biology, Volume 2, pp. 11.2.1-11.2.22, John Wiley & Sons, Inc., 1991.

Example 11

[0238] Poly(A)+ mRNA Isolation

[0239] Poly(A)+ mRNA was isolated according to Miura et al., Clin. Chem., 1996, 42, 1758-1764. Other methods for poly(A)+ mRNA isolation are taught in, for example, Ausubel, F. M. et al., Current Protocols in Molecular Biology, Volume 1, pp. 4.5.1-4.5.3, John Wiley & Sons, Inc., 1993. Briefly, for cells grown on 96-well plates, growth medium was removed from the cells and each well was washed with 200 μL cold PBS. 60 μL lysis buffer (10 mM Tris-HCl, pH 7.6, 1 mM EDTA, 0.5 M NaCl, 0.5% NP-40, 20 mM vanadyl-ribonucleoside complex) was added to each well, the plate was gently agitated and then incubated at room temperature for five minutes. 55 μL of lysate was transferred to Oligo d(T) coated 96-well plates (AGCT Inc., Irvine Calif.). Plates were incubated for 60 minutes at room temperature, washed 3 times with 200 μL of wash buffer (10 mM Tris-HCl pH 7.6, 1 mM EDTA, 0.3 M NaCl). After the final wash, the plate was blotted on paper towels to remove excess wash buffer and then air-dried for 5 minutes. 60 μL of elution buffer (5 mM Tris-HCl pH 7.6), preheated to 70° C. was added to each well, the plate was incubated on a 90° C. hot plate for 5 minutes, and the eluate was then transferred to a fresh 96-well plate.

[0240] Cells grown on 100 mm or other standard plates may be treated similarly, using appropriate volumes of all solutions.

Example 12

[0241] Total RNA Isolation

[0242] Total RNA was isolated using an RNEASY 96™ kit and buffers purchased from Qiagen Inc. (Valencia Calif.) following the manufacturer's recommended procedures. Briefly, for cells grown on 96-well plates, growth medium was removed from the cells and each well was washed with 200 μL cold PBS. 100 μL Buffer RLT was added to each well and the plate vigorously agitated for 20 seconds. 100 μL of 70% ethanol was then added to each well and the contents mixed by pipetting three times up and down. The samples were then transferred to the RNEASY 96™ well plate attached to a QIAVAC™ manifold fitted with a waste collection tray and attached to a vacuum source. Vacuum was applied for 15 seconds. 1 mL of Buffer RW1 was added to each well of the RNEASY 96™ plate and the vacuum again applied for 15 seconds. 1 mL of Buffer RPE was then added to each well of the RNEASY 96™ plate and the vacuum applied for a period of 15 seconds. The Buffer RPE wash was then repeated and the vacuum was applied for an additional 10 minutes. The plate was then removed from the QIAVAC™ manifold and blotted dry on paper towels. The plate was then re-attached to the QIAVAC™ manifold fitted with a collection tube rack containing 1.2 mL collection tubes. RNA was then eluted by pipetting 60 μL water into each well, incubating 1 minute, and then applying the vacuum for 30 seconds. The elution step was repeated with an additional 60 μL water.

[0243] The repetitive pipetting and elution steps may be automated using a QIAGEN Bio-Robot 9604 (Qiagen, Inc., Valencia Calif.). Essentially, after lysing of the cells on the culture plate, the plate is transferred to the robot deck where the pipetting, DNase treatment and elution steps are carried out.

Example 13

[0244] Real-Time Quantitative PCR Analysis of FXR mRNA Levels

[0245] Quantitation of FXR mRNA levels was determined by real-time quantitative PCR using the ABI PRISM™ 7700 Sequence Detection System (PE-Applied Biosystems, Foster City, Calif.) according to manufacturer's instructions. This is a closed-tube, non-gel-based, fluorescence detection system which allows high-throughput quantitation of polymerase chain reaction (PCR) products in real-time. As opposed to standard PCR, in which amplification products are quantitated after the PCR is completed, products in real-time quantitative PCR are quantitated as they accumulate. This is accomplished by including in the PCR reaction an oligonucleotide probe that anneals specifically between the forward and reverse PCR primers, and contains two fluorescent dyes. A reporter dye (e.g., JOE, FAM, or VIC, obtained from either Operon Technologies Inc., Alameda, Calif. or PE-Applied Biosystems, Foster City, Calif.) is attached to the 5′ end of the probe and a quencher dye (e.g., TAMRA, obtained from either Operon Technologies Inc., Alameda, Calif. or PE-Applied Biosystems, Foster City, Cailf.) is attached to the 3′ end of the probe. When the probe and dyes are intact, reporter dye emission is quenched by the proximity of the 3′ quencher dye. During amplification, annealing of the probe to the target sequence creates a substrate that can be cleaved by the 5′-exonuclease activity of Taq polymerase. During the extension phase of the PCR amplification cycle, cleavage of the probe by Taq polymerase releases the reporter dye from the remainder of the probe (and hence from the quencher moiety) and a sequence-specific fluorescent signal is generated. With each cycle, additional reporter dye molecules are cleaved from their respective probes, and the fluorescence intensity is monitored at regular intervals by laser optics built into the ABI PRISM™ 7700 Sequence Detection System. In each assay, a series of parallel reactions containing serial dilutions of mRNA from untreated control samples generates a standard curve that is used to quantitate the percent inhibition after antisense oligonucleotide treatment of test samples.

[0246] Prior to quantitative PCR analysis, primer-probe sets specific to the target gene being measured are evaluated for their ability to be “multiplexed” with a GAPDH amplification reaction. In multiplexing, both the target gene and the internal standard gene GAPDH are amplified concurrently in a single sample. In this analysis, mRNA isolated from untreated cells is serially diluted. Each dilution is amplified in the presence of primer-probe sets specific for GAPDH only, target gene only (“single-plexing”), or both (multiplexing). Following PCR amplification, standard curves of GAPDH and target mRNA signal as a function of dilution are generated from both the single-plexed and multiplexed samples. If both the slope and correlation coefficient of the GAPDH and target signals generated from the multiplexed samples fall within 10% of their corresponding values generated from the single-plexed samples, the primer-probe set specific for that target is deemed multiplexable. Other methods of PCR are also known in the art.

[0247] PCR reagents were obtained from PE-Applied Biosystems, Foster City, Calif. RT-PCR reactions were carried out by adding 25 μL PCR cocktail (1×TAQMAN™ buffer A, 5.5 mM MgCl₂, 300 μM each of dATP, dCTP and dGTP, 600 μM of dUTP, 100 nM each of forward primer, reverse primer, and probe, 20 Units RNAse inhibitor, 1.25 Units AMPLITAQ GOLD™, and 12.5 Units MuLV reverse transcriptase) to 96 well plates containing 25 μL total RNA solution. The RT reaction was carried out by incubation for 30 minutes at 48° C. Following a 10 minute incubation at 95° C. to activate the AMPLITAQ GOLD™, 40 cycles of a two-step PCR protocol were carried out: 95° C. for 15 seconds (denaturation) followed by 60° C. for 1.5 minutes (annealing/extension).

[0248] Gene target quantities obtained by real time RT-PCR are normalized using either the expression level of GAPDH, a gene whose expression is constant, or by quantifying total RNA using RiboGreen™ (Molecular Probes, Inc. Eugene, Oreg.). GAPDH expression is quantified by real time RT-PCR, by being run simultaneously with the target, multiplexing, or separately. Total RNA is quantified using RiboGreen™ RNA quantification reagent from Molecular Probes. Methods of RNA quantification by RiboGreen™ are taught in Jones, L. J., et al, Analytical Biochemistry, 1998, 265, 368-374.

[0249] In this assay, 175 μL of RiboGreen™ working reagent (RiboGreen™ reagent diluted 1:2865 in 10 mM Tris-HCl, 1 mM EDTA, pH 7.5) is pipetted into a 96-well plate containing 25 uL purified, cellular RNA. The plate is read in a CytoFluor 4000 (PE Applied Biosystems) with excitation at 480 nm and emission at 520 nm.

[0250] Probes and primers to human FXR were designed to hybridize to a human FXR sequence, using published sequence information (GenBank accession number U68233.1, incorporated herein as SEQ ID NO: 3). For human FXR the PCR primers were: forward primer: TCAGTGTAAATCTAAGCGACTGAGAAA (SEQ ID NO: 4) reverse primer: GCAAGTCACGACCTTCACTGTCT (SEQ ID NO: 5) and the PCR probe was: FAM-AGCAGCATGCAGATCAGACCGTGAATG-TAMRA (SEQ ID NO: 6) where FAM (PE-Applied Biosystems, Foster City, Calif.) is the fluorescent reporter dye) and TAMRA (PE-Applied Biosystems, Foster City, Calif.) is the quencher dye. For human GAPDH the PCR primers were: forward primer: GAAGGTGAAGGTCGGAGTC(SEQ ID NO: 7) reverse primer: GAAGATGGTGATGGGATTTC (SEQ ID NO: 8) and the PCR probe was: 5′ JOE-CAAGCTTCCCGTTCTCAGCC-TAMRA 3′ (SEQ ID NO: 9) where JOE (PE-Applied Biosystems, Foster City, Calif.) is the fluorescent reporter dye) and TAMRA (PE-Applied Biosystems, Foster City, Calif.) is the quencher dye.

Example 14

[0251] Northern Blot Analysis of FXR mRNA Levels

[0252] Eighteen hours after antisense treatment, cell monolayers were washed twice with cold PBS and lysed in 1 mL RNAZOL™ (TEL-TEST “B” Inc., Friendswood, Tex.). Total RNA was prepared following manufacturer's recommended protocols. Twenty micrograms of total RNA was fractionated by electrophoresis through 1.2% agarose gels containing 1.1% formaldehyde using a MOPS buffer system (AMRESCO, Inc. Solon, Ohio). RNA was transferred from the gel to HYBOND™-N+ nylon membranes (Amersham Pharmacia Biotech, Piscataway, N.J.) by overnight capillary transfer using a Northern/Southern Transfer buffer system (TEL-TEST “B” Inc., Friendswood, Tex.). RNA transfer was confirmed by UV visualization. Membranes were fixed by UV cross-linking using a STRATALINKER™ UV Crosslinker 2400 (Stratagene, Inc, La Jolla, Calif.) and then probed using QUICKHYB™ hybridization solution (Stratagene, La Jolla, Calif.) using manufacturer's recommendations for stringent conditions.

[0253] To detect human FXR, a human FXR specific probe was prepared by PCR using the forward primer TCAGTGTAAATCTAAGCGACTGAGAAA (SEQ ID NO: 4) and the reverse primer GCAAGTCACGACCTTCACTGTCT (SEQ ID NO: 5). To normalize for variations in loading and transfer efficiency membranes were stripped and probed for human glyceraldehyde-3-phosphate dehydrogenase (GAPDH) RNA (Clontech, Palo Alto, Calif.).

[0254] Hybridized membranes were visualized and quantitated using a PHOSPHORIMAGER™ and IMAGEQUANT™ Software V3.3 (Molecular Dynamics, Sunnyvale, Calif.). Data was normalized to GAPDH levels in untreated controls.

Example 15

[0255] Antisense Inhibition of Human FXR Expression by Chimeric Phosphorothioate Oligonucleotides Having 2′-MOE Wings and a Deoxy Gap

[0256] In accordance with the present invention, a series of oligonucleotides were designed to target different regions of the human FXR RNA, using published sequences (GenBank accession number U68233.1, incorporated herein as SEQ ID NO: 3, and residues 22001-113000 of GenBank accession number AC010200.7, incorporated herein as SEQ ID NO: 10). The oligonucleotides are shown in Table 1. “Target site” indicates the first (5′-most) nucleotide number on the particular target sequence to which the oligonucleotide binds. All compounds in Table 1 are chimeric oligonucleotides (“gapmers”) 20 nucleotides in length, composed of a central “gap” region consisting of ten 2′-deoxynucleotides, which is flanked on both sides (5′ and 3′ directions) by five-nucleotide “wings”. The wings are composed of 2′-methoxyethyl (2′-MOE)nucleotides. The internucleoside (backbone) linkages are phosphorothioate (P═S) throughout the oligonucleotide. All cytidine residues are 5-methylcytidines. The compounds were analyzed for their effect on human FXR mRNA levels by quantitative real-time PCR as described in other examples herein. Data are averages from two experiments. If present, “N.D.” indicates “no data”. TABLE 1 Inhibition of human FXR mRNA levels by chimeric phosphorothioate oligonucleotides having 2′-MOE wings and a deoxy gap TARGET TARGET SEQ ID ISIS # REGION SEQ ID NO SITE SEQUENCE % INHIB NO 126448 5′UTR 3 146 cttggattgttttgggtcag 57 11 126457 Coding 3 549 aaacccaggttggaataata 14 12 126465 Coding 3 717 acacacagctcatccccttt 32 13 126471 Coding 3 861 cgcatgtacatatccatcac 77 14 126474 Coding 3 917 cagccaacattcccatctct 44 15 126476 Coding 3 946 tttacactgaatttcagtta 35 16 126479 Coding 3 981 gcatgctgcttcacattttt 91 17 126483 Coding 3 1067 gttcagttttctccctgcat 84 18 126487 Coding 3 1191 atgagaaaattttcttctgc 64 19 126492 Coding 3 1293 agcaaagcaatctggtcttc 41 20 126494 Coding 3 1354 aagtttcttattgaaaatct 0 21 126501 Coding 3 1542 gcctctctatcctttatgta 38 22 126506 Coding 3 1688 tcagcatctcagcgtggtga 22 23 145287 Intron 10 30209 gggaaacatccttggatttc 11 24 3 145288 Intron 10 51690 tgtacaagaactgcatgctg 39 25 4 145289 Intron 10 71293 tagctttggacattaacaac 59 26 9 145290 Intron 10 77281 agattatctcctttctctga 58 27 9 145291 Intron 10 86836 ggcatttacttaatttaagc 59 28 9 145292 5′UTR 3 44 atccaatttcgcattaggat 16 29 145293 5′UTR 3 75 gatcccagcgattttgctac 64 30 145294 5′UTR 3 102 ggattctggactgagtcttc 26 31 145295 5′UTR 3 115 caaggccctgggaggattct 65 32 145296 5′UTR 3 133 gggtcagagatggactttca 30 33 145297 5′UTR 3 156 cttctacctccttggattgt 70 34 145298 5′UTR 3 259 ttgaggaaatgtccagaaga 35 35 145299 5′UTR 3 316 atgcactttctttatggtgg 42 36 145300 5′UTR 3 320 tgaaatgcactttctttatg 44 37 145301 Start 3 345 tttgatcccatccaaatttt 30 38 Codon 145302 Coding 3 534 taataggatgacgaggaaat 66 39 145303 Coding 3 553 gtagaaacccaggttggaat 65 40 145304 Coding 3 587 ttccaggagagtaccactct 79 41 145305 Coding 3 596 gttcatatattccaggagag 30 42 145306 Coding 3 609 ggcatacgcctgagttcata 0 43 145307 Coding 3 645 gctacctcagtttctccctg 56 44 145308 Coding 3 699 ttgatcctccctgctgacgc 31 45 145309 Coding 3 726 ccacaaacaacacacagctc 0 46 145310 Coding 3 734 ctctgtctccacaaacaaca 0 47 145311 Coding 3 745 gtatccagaggctctgtctc 66 48 145312 Coding 3 751 atagtggtatccagaggctc 59 49 145313 Coding 3 757 tgcattatagtggtatccag 65 50 145314 Coding 3 931 agttaacaagcattcagcca 30 51 145315 Coding 3 1033 ggtcacttgtcgcaagtcac 69 52 145316 Coding 3 1039 tgtcgaggtcacttgtcgca 56 53 145317 Coding 3 1065 tcagttttctccctgcatga 70 54 145318 Coding 3 1134 tgaggcatcctctgtttgtt 66 55 145319 Coding 3 1143 gttatttcctgaggcatcct 70 56 145320 Coding 3 1178 cttctgcactgaattcttct 31 57 145321 Coding 3 1213 attggttgccatttccgtca 51 58 145322 Coding 3 1264 agtctgaaatcctggtagct 34 59 145323 Coding 3 1332 gctgaacgaaggaacatagc 5 60 145324 Coding 3 1342 gaaaatctcagctgaacgaa 0 61 145325 Coding 3 1364 gcccagacggaagtttctta 11 62 145326 Coding 3 1390 aattctttcttccaataggt 17 63 145327 Coding 3 1411 atcagagataccactatttc 51 64 145328 Coding 3 1429 cataggtgttatatattcat 0 65 145329 Coding 3 1493 ctgtaagcagagcatactcc 37 66 145330 Coding 3 1525 gtattgtctatctggagaca 53 67 145331 Coding 3 1567 aagtggctcctgaagcttct 69 68 145332 Coding 3 1608 tcaggctggtgaatcttaca 78 69 145333 Coding 3 1623 aagtgttgaggattttcagg 54 70 145334 Coding 3 1630 acaggcaaagtgttgaggat 68 71 145335 Coding 3 1654 taattcagtcaggcgaccca 71 72 145336 Coding 3 1672 gtgatgattgaatgtccgta 50 73 145337 Coding 3 1692 gacatcagcatctcagcgtg 73 74 145338 Stop 3 1761 aatccccatcactgcacgtc 45 75 Codon 145339 3′UTR 3 1789 agaaaaaggagctagacccc 18 76 145340 3′UTR 3 1812 tatacatcagattaatatga 0 77 145341 3′UTR 3 1819 ggaaagttatacatcagatt 65 78 145342 3′UTR 3 1839 ctgggtacaagtgaaataaa 0 79 145343 3′UTR 3 1847 gagtgaaactgggtacaagt 69 80 145344 3′UTR 3 1868 aaatattcatcaagatttct 28 81 145345 3′UTR 3 1893 gaagttacacatgtaattac 54 82 145346 3′UTR 3 1953 agccttttaaaacacaatgt 37 83 145347 3′UTR 3 2036 cctagttcaacagatagaat 57 84 145348 3′UTR 3 2041 ttttccctagttcaacagat 60 85 145349 3′UTR 3 2051 caaaatgagattttccctag 7 86 145350 3′UTR 3 2100 agattgaatacaactcttta 59 87 145351 3′UTR 3 2117 tgtttgctttattgccaaga 49 88

[0257] As shown in Table 1, SEQ ID NOs 11, 13, 14, 15, 16, 17, 18, 19, 20, 22, 25, 26, 27, 28, 30, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 44, 45, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 64, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 78, 80, 82, 83, 84, 85, 87 and 88 demonstrated at least 30% inhibition of human FXR expression in this assay and are therefore preferred. The target sites to which these preferred sequences are complementary are herein referred to as “active sites” and are therefore preferred sites for targeting by compounds of the present invention.

Example 16

[0258] Western Blot Analysis of FXR Protein Levels

[0259] Western blot analysis (immunoblot analysis) is carried out using standard methods. Cells are harvested 16-20 h after oligonucleotide treatment, washed once with PBS, suspended in Laemmli buffer (100 ul/well), boiled for 5 minutes and loaded on a 16% SDS-PAGE gel. Gels are run for 1.5 hours at 150 V, and transferred to membrane for western blotting. Appropriate primary antibody directed to FXR is used, with a radiolabelled or fluorescently labeled secondary antibody directed against the primary antibody species. Bands are visualized using a PHOSPHORIMAGER™ (Molecular Dynamics, Sunnyvale Calif.).

1 88 1 20 DNA Artificial Sequence Antisense Oligonucleotide 1 tccgtcatcg ctcctcaggg 20 2 20 DNA Artificial Sequence Antisense Oligonucleotide 2 atgcattctg cccccaagga 20 3 2218 DNA Homo sapiens CDS (354)...(1772) 3 acgagactct ctcctcctcc tcacctcatt gtctccccga cttatcctaa tgcgaaattg 60 gattctgagc atttgtagca aaatcgctgg gatctggaga ggaagactca gtccagaatc 120 ctcccagggc cttgaaagtc catctctgac ccaaaacaat ccaaggaggt agaagacatc 180 gtagaaggag tgaaagaaga aaagaagact tagaaacata gctcaaagtg aacactgctt 240 ctcttagttt cctggatttc ttctggacat ttcctcaaga tgaaacttca gacactttgg 300 agtttttttt gaagaccacc ataaagaaag tgcatttcaa ttgaaaaatt tgg atg 356 Met 1 gga tca aaa atg aat ctc att gaa cat tcc cat tta cct acc aca gat 404 Gly Ser Lys Met Asn Leu Ile Glu His Ser His Leu Pro Thr Thr Asp 5 10 15 gaa ttt tct ttt tct gaa aat tta ttt ggt gtt tta aca gaa caa gtg 452 Glu Phe Ser Phe Ser Glu Asn Leu Phe Gly Val Leu Thr Glu Gln Val 20 25 30 gca ggt cct ctg gga cag aac ctg gaa gtg gaa cca tac tcg caa tac 500 Ala Gly Pro Leu Gly Gln Asn Leu Glu Val Glu Pro Tyr Ser Gln Tyr 35 40 45 agc aat gtt cag ttt ccc caa gtt caa cca cag att tcc tcg tca tcc 548 Ser Asn Val Gln Phe Pro Gln Val Gln Pro Gln Ile Ser Ser Ser Ser 50 55 60 65 tat tat tcc aac ctg ggt ttc tac ccc cag cag cct gaa gag tgg tac 596 Tyr Tyr Ser Asn Leu Gly Phe Tyr Pro Gln Gln Pro Glu Glu Trp Tyr 70 75 80 tct cct gga ata tat gaa ctc agg cgt atg cca gct gag act ctc tac 644 Ser Pro Gly Ile Tyr Glu Leu Arg Arg Met Pro Ala Glu Thr Leu Tyr 85 90 95 cag gga gaa act gag gta gca gag atg cct gta aca aag aag ccc cgc 692 Gln Gly Glu Thr Glu Val Ala Glu Met Pro Val Thr Lys Lys Pro Arg 100 105 110 atg ggc gcg tca gca ggg agg atc aaa ggg gat gag ctg tgt gtt gtt 740 Met Gly Ala Ser Ala Gly Arg Ile Lys Gly Asp Glu Leu Cys Val Val 115 120 125 tgt gga gac aga gcc tct gga tac cac tat aat gca ctg acc tgt gag 788 Cys Gly Asp Arg Ala Ser Gly Tyr His Tyr Asn Ala Leu Thr Cys Glu 130 135 140 145 ggg tgt aaa ggt ttc ttc agg aga agc att acc aaa aac gct gtg tac 836 Gly Cys Lys Gly Phe Phe Arg Arg Ser Ile Thr Lys Asn Ala Val Tyr 150 155 160 aag tgt aaa aac ggg ggc aac tgt gtg atg gat atg tac atg cga aga 884 Lys Cys Lys Asn Gly Gly Asn Cys Val Met Asp Met Tyr Met Arg Arg 165 170 175 aag tgt caa gag tgt cga cta agg aaa tgc aaa gag atg gga atg ttg 932 Lys Cys Gln Glu Cys Arg Leu Arg Lys Cys Lys Glu Met Gly Met Leu 180 185 190 gct gaa tgc ttg tta act gaa att cag tgt aaa tct aag cga ctg aga 980 Ala Glu Cys Leu Leu Thr Glu Ile Gln Cys Lys Ser Lys Arg Leu Arg 195 200 205 aaa aat gtg aag cag cat gca gat cag acc gtg aat gaa gac agt gaa 1028 Lys Asn Val Lys Gln His Ala Asp Gln Thr Val Asn Glu Asp Ser Glu 210 215 220 225 ggt cgt gac ttg cga caa gtg acc tcg aca aca aag tca tgc agg gag 1076 Gly Arg Asp Leu Arg Gln Val Thr Ser Thr Thr Lys Ser Cys Arg Glu 230 235 240 aaa act gaa ctc acc cca gat caa cag act ctt cta cat ttt att atg 1124 Lys Thr Glu Leu Thr Pro Asp Gln Gln Thr Leu Leu His Phe Ile Met 245 250 255 gat tca tat aac aaa cag agg atg cct cag gaa ata aca aat aaa att 1172 Asp Ser Tyr Asn Lys Gln Arg Met Pro Gln Glu Ile Thr Asn Lys Ile 260 265 270 tta aaa gaa gaa ttc agt gca gaa gaa aat ttt ctc att ttg acg gaa 1220 Leu Lys Glu Glu Phe Ser Ala Glu Glu Asn Phe Leu Ile Leu Thr Glu 275 280 285 atg gca acc aat cat gta cag gtt ctt gta gaa ttc aca aaa aag cta 1268 Met Ala Thr Asn His Val Gln Val Leu Val Glu Phe Thr Lys Lys Leu 290 295 300 305 cca gga ttt cag act ttg gac cat gaa gac cag att gct ttg ctg aaa 1316 Pro Gly Phe Gln Thr Leu Asp His Glu Asp Gln Ile Ala Leu Leu Lys 310 315 320 ggg tct gcg gtt gaa gct atg ttc ctt cgt tca gct gag att ttc aat 1364 Gly Ser Ala Val Glu Ala Met Phe Leu Arg Ser Ala Glu Ile Phe Asn 325 330 335 aag aaa ctt ccg tct ggg cat tct gac cta ttg gaa gaa aga att cga 1412 Lys Lys Leu Pro Ser Gly His Ser Asp Leu Leu Glu Glu Arg Ile Arg 340 345 350 aat agt ggt atc tct gat gaa tat ata aca cct atg ttt agt ttt tat 1460 Asn Ser Gly Ile Ser Asp Glu Tyr Ile Thr Pro Met Phe Ser Phe Tyr 355 360 365 aaa agt att ggg gaa ctg aaa atg act caa gag gag tat gct ctg ctt 1508 Lys Ser Ile Gly Glu Leu Lys Met Thr Gln Glu Glu Tyr Ala Leu Leu 370 375 380 385 aca gca att gtt atc ctg tct cca gat aga caa tac ata aag gat aga 1556 Thr Ala Ile Val Ile Leu Ser Pro Asp Arg Gln Tyr Ile Lys Asp Arg 390 395 400 gag gca gta gag aag ctt cag gag cca ctt ctt gat gtg cta caa aag 1604 Glu Ala Val Glu Lys Leu Gln Glu Pro Leu Leu Asp Val Leu Gln Lys 405 410 415 ttg tgt aag att cac cag cct gaa aat cct caa cac ttt gcc tgt ctc 1652 Leu Cys Lys Ile His Gln Pro Glu Asn Pro Gln His Phe Ala Cys Leu 420 425 430 ctg ggt cgc ctg act gaa tta cgg aca ttc aat cat cac cac gct gag 1700 Leu Gly Arg Leu Thr Glu Leu Arg Thr Phe Asn His His His Ala Glu 435 440 445 atg ctg atg tca tgg aga gta aac gac cac aag ttt acc cca ctt ctc 1748 Met Leu Met Ser Trp Arg Val Asn Asp His Lys Phe Thr Pro Leu Leu 450 455 460 465 tgt gaa atc tgg gac gtg cag tga tggggattac aggggagggg tctagctcct 1802 Cys Glu Ile Trp Asp Val Gln 470 ttttctctct catattaatc tgatgtataa ctttccttta tttcacttgt acccagtttc 1862 actcaagaaa tcttgatgaa tatttatgtt gtaattacat gtgtaacttc cacaactgta 1922 aatattgggc tagatagaac aactttctct acattgtgtt ttaaaaggct ccagggaatc 1982 ctgcattcta attggcaagc cctgtttgcc taattaaatt gattgttact tcaattctat 2042 ctgttgaact agggaaaatc tcattttgct catcttacca tattgcatat attttattaa 2102 agagttgtat tcaatcttgg caataaagca aacataatgg caacagaaaa aaaaaaaaaa 2162 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 2218 4 27 DNA Artificial Sequence PCR Primer 4 tcagtgtaaa tctaagcgac tgagaaa 27 5 23 DNA Artificial Sequence PCR Primer 5 gcaagtcacg accttcactg tct 23 6 27 DNA Artificial Sequence PCR Probe 6 agcagcatgc agatcagacc gtgaatg 27 7 19 DNA Artificial Sequence PCR Primer 7 gaaggtgaag gtcggagtc 19 8 20 DNA Artificial Sequence PCR Primer 8 gaagatggtg atgggatttc 20 9 20 DNA Artificial Sequence PCR Probe 9 caagcttccc gttctcagcc 20 10 91000 DNA Homo sapiens misc_feature 73772-74071 n = A,T,C or G 10 tgatatgggt ggctgggcag agggaagggt caaaaaggct cccagcttct agttcagtcc 60 cggggaagtg atagagctat tcaaagaagt tgtagtgaga gagagggaag atgacagttt 120 ggttttagtt gatgtgtaca gaaatacggg tgcccaggag cccacaaaac ggccagagga 180 gaaatgcttt caaaggcaag ctgcagggct ccttggtttt gtcacattcc tcattctggg 240 gctttgcggt tttgtcttgg gaatctcgag gctctcccaa ggttcctttc tatgtttata 300 tcatttagca gggaaggatt gttaatgact aatctgtgtc catgaggcac agagccaagg 360 aagagatgct gctgctagcc cagaaggccg cctgtgatca tgcacagtac actggaactc 420 tctcctcctc ctcacctcat tgtctccccg acttatccta atgcgaaatt ggattctgag 480 catttgtagc aaaatcgctg ggatctggag aggaagactc agtccagaat cctcccaggg 540 ccttgaaagt ccatctctga cccaaaacaa tccaagtaag tacctaattc ctttgggagt 600 gggttgtgta tctcacagca acagagaaaa aatagtcact taaaagtttc tctttgacat 660 ctgtaatgta tgtcaataaa tgaattctaa gttagtagag tttgatgtaa agtcctgaaa 720 attaaaaaag agagaaacta aaaaacaaaa agaagcagaa gcaaaagtta atgagtctta 780 acagttgctt acctattgaa aacttattta gaaatactct tttaacattg tggtcacctg 840 agtaaatcac tggagatagt gcatttcaga aatgtctccg ttctgattcc ataaacaatt 900 tgacttgtat agtgtgctat attttggtga tttatcaaat cttgatgtga gtttgggagt 960 attgctaatg tcagatgact tgggaactaa gaataagaca tttaacctat gcttaattga 1020 aatgaaattt ttccctagaa gaagagtagg tggaaaaagt cttctttctt gacttcagtt 1080 gtaaactctt ctattgcttt ccattttgaa tattaatatg acaggaaata tcagatggaa 1140 atatttttaa aagatagaaa tgtgagtatg acgaagaact ttagtaataa aattgtccaa 1200 ggactaaatt tatagataag atacctcttt gtctccttat tgacagagtg aatggggcaa 1260 ctgtggagcc tgacttactt cttttaattg ggtttttatt cagaagggag gggcaggagg 1320 gaatgacaag tgactcacct tgaattcttc ctctaagaaa ctcacacctg agctttgagc 1380 tataaagaaa tctgatgctg tttctggtgc tgtcttagaa tcacttcagg agtattgaca 1440 agaggggtag gaacccttag aaataatatt agtgataaat aagaaggcag gaagaaactt 1500 ttggaggtga tggataggtt tatggtatag attgtggtga tgatttaatg agtgtatgcc 1560 tatccccaga ctcatcaaag tgtatacatg gaatatgtaa agcttttata tgtcagtcac 1620 acctcagtaa agtggtttac ctatctatct atctatctat ctatctatct atctaaattt 1680 ttttttctgt tcctaaaaaa aggaagggag aagagaggaa aagatgttca gggagctacc 1740 attttgtttc tagctgtgat tttataaaat gatagacact tttatctttg tgttacgttc 1800 ctacccccag tcctccaaat tatggatctg tgccatttgt accgtggact tttctgtttt 1860 ctgaggatgt tgcaacaaat actgatgcaa ctcctggtta actgataaag tactggccag 1920 ggacaaagct ctcttgtcct gagacccttc ctcaagattt gcagcaattt cccaccacgt 1980 acctctgccc tctcctcaca gctggagagg gaaagtcatg gaatccttgt ccttcctctt 2040 gtttccacct cttcaagatt gggccaattg caatggaata tccattggtt gtgaggcctt 2100 tgtactctgc aaggaaaaga aaagaaatgt gtgtatgtat gagtgtgtga tggagctaac 2160 ttttctacaa tgtctactaa catgtcctag cctttacttc attcgcctgt ttccttctca 2220 caaaaaccct gtatgggagt ttttctttac tttttattat tatttttttg agacaaagtc 2280 tcgctctgtc tcccaggctg gagtgcagtg gcgctatatc ggctcactgc agcctccacc 2340 tcccgggttc aagcgattct cctgcctcag cctcctgagt agctggtact acaggcgtgc 2400 accaccatgc cactattttt tgtattttta gtagagacgg ggtttcacta tgttggccag 2460 actggtctcg aactcttgac ctcaggtgat ccgcccgcct cggcttccca gagtgctagg 2520 attacaggcg tgagccactg cgcccagcca ggagtttttc ttatactcat tttacagatg 2580 agaaaactga gactcaaaaa atacaagtga cccgtccaca ggcagatagt taggaagtag 2640 cgggacctga acttgagggc gggtctttct gactccaaag cctcttcctg gctactctga 2700 tattggctat tggcggaggc tgggaaaact tgaaatgggg aatgatcggg gagcggcgag 2760 gggggaccag ccgttaagca ttccagcctg acaggggtga tttgttaaac ccaggaacta 2820 gttagacgtt tcctgaaacc tcctgcatag ggcattttcg agagattgca ccatcagtag 2880 ggaaactgga aagtttggag cactatttgt tttaacccag agaccagtct aagatagctg 2940 ggtgaatatt tccgttatca gcgttttaat taaagaccac aatgaactct ccatgctgct 3000 gtgattaggg acatcttaga gaggagctca gaactgctgt ggctcacagt ctctggatta 3060 tgaatgccat gagaatgcat atcatgatgt gatagatttt caaaggctca cattgtcatt 3120 aggtgggtgc ataaaggaca gaaaaggttc tttacttcat agaaagagat ttcccttccc 3180 tttgctctca gtctcatttt tatcttagtc tctagcaaca cagattataa aggggaataa 3240 aagaaaatag gaacaagcta gatacagtga tgagtgcctg tgatcccagc agctcgggcg 3300 gctgaggtgg aaggattgct tgagctcagg agttcaaggc tgcagtgagc tatgataatg 3360 ccactatact ctagcctggg taacatagag agacaccacc tcaaaatttt ttttaaaaaa 3420 taagaaatag gaacaatttt cctcccaggt atttttttat tgtcacagta gtttctacca 3480 gtgaagatgt agctaaaagg catcaccata cgagtgtcct tagtcaccaa gctgtatgag 3540 gaggcaaaaa tcattattac atacaaatat tcatccttct gtcttttctc tcccttctct 3600 caccctgctc tggaatcccc taaccccttt gctcttcaca gtgaattatt ttctccctca 3660 gtttgcccat ttccaatttt ggccataaaa gaacatctac tattcctact ctctctgctt 3720 tgtaaattgg tgataagatg acactaatat aatgcaactt gattgcaaag taaaattgca 3780 aatattgcac aggaaaaaaa ataagattta ttctttgtta ggattaagaa atataacagg 3840 gttctggtga ttaaaataaa taagtataaa tataaaataa atttttatta attttcattt 3900 tttaatataa aatcctaaca aaactttgag attttgcttc ccattttaaa gagatttaga 3960 tgacctttcc caggtctcaa tgaacctgag acaacctggg tctattacac tttaattaat 4020 gttattaatg taaagaaaag gagaaatggt atggctttgc attttcaagt cctgtagata 4080 tatttaacac agttatctgg aaaacatggt aaataatgta gttgtgcatg gagaagatag 4140 tcctgctgtg ggatgttgct gattagagaa aacaaaaaca ctgctctggg ctgtgactgt 4200 tcctaagcag agagctattt gctgccatgt cctagagagt tatggaagct gccagtgtag 4260 acgacaagtt gataccagcc tggggcttaa ggagcaaaag ctggttgcca tccagaattt 4320 gcagggcaga gacaggacaa agaaatgtca ttccaaagtg gtcatgtaaa agattcttgc 4380 tctcctggag tcccgactct gaatctgagg ctatagctgg gcttcagggg gccagggtca 4440 cttaagacta catttaaaaa tctatgcatt ttacagagaa tttttagggc agtgaattca 4500 ttctatatga tattaccaat ggtgggtaca tgttattata cacttgtcaa gacccataga 4560 atacacaaca ccaagattga gccctaatgt aaactatagt ccttcgctga taaagatgtg 4620 tcagcatagg ttcatcgatt gtaaaaaaat gtgccactct ggtgctggtt gttgaaagta 4680 aaggaggctg tgcatgtgtg tggggggtgg gggagaagac gtatatggag gctctccata 4740 cattttgttc aattttatag tgaacataaa atttctccta aacatttatt aatttagaaa 4800 aatatctgta ttttgtgtat gtctttccta gagagaggtt tataattttt gttgaagctc 4860 aaaaatgctg cacaatccta aaattgtttt aagaaaatga agcaaaccct tcttttctgg 4920 agaatctaaa tatacctgct tatagcatct gattatataa ttggcaaagt tgaggacagg 4980 aaatgatacc ctaggagact gacatatctt aatgtagtca ttgtgttaaa agtttcaaca 5040 ttatgaataa gtctaaaata ttttttgacc ttcccctagc cctattctcc tctacttcac 5100 tgtttcaatg ttaatacata tcaaaacact ttaatcttgt ttaaccactt cctaatagtt 5160 gatactatag agcacagttc atttagccat tccactgttg atggtttttt aggtggtttg 5220 ccctctttct gcattagagg caatgctaca atgaacatcc tggtgcatac cagcaaagat 5280 ttctccagga aagataccta ggaattaaaa ttgcacttta ttttaagata cactgctaaa 5340 gagccctcca aaaagactaa aacctatgcc cccactagca atatgtgaga atgcatattt 5400 tcatacactt tgaatattat caaagttaaa attttttgat aatctcatag gtgaaaatgg 5460 tatttagtta tagtaatatc cattttccta attattatta gtttcagata ttttctgtgt 5520 ttactggcca cttgtgtttc cactttggtg aatttccttt ttgtatcctt tgccaatttc 5580 tctcttgggt tgtttatcct ttctgttttt atttatagga atcgtttata tattgtttgt 5640 actaatattt ttggtgttat ttatgttgta aatattttta tccattttta tatatttagc 5700 tttgtttgtg atgccttaac ctctgtctta catttcaacg ttctcaaatg tattcttttc 5760 ctatttactt cttatttaat aaagtccttc tcaatcccca aatccataaa gatcataaat 5820 atattttctc ccaataattt tataattttc agtctttaat ccatctgaga tgtattttcc 5880 aagaatgatt cataaagatg tctttatttt taccctcaaa gcataggcaa ttttcttgcc 5940 acaatttctt gaataggtca tccttcttcc actgatttga gctgagtcca ctagctacag 6000 catagacaac tatttagggt tgctcagaac acagcatgct attttccctt cctatagcca 6060 tccctgttgc gtgtcatttt cctagcctaa tgcctattcc ttcaagacta agttcaagaa 6120 tcaaccctcc acgctcagaa agatttccct cattctcctc actataccct tgtttaaaac 6180 tcatcaatta cttccattta gaagaaaatc caaactcctt accatggcca acaggccata 6240 caagagctgc ctactctcag acctcatttc ctatagcccc tccttggctc actcatttcc 6300 agcctcactg ggcttgtgtt gttcttccaa cactccaagc ccaggccctt tgcttttgct 6360 gatcactctg cctgaactta ttttcccttt gtatgactta ctcattcttg tccttcacat 6420 ctcagttcga atgtccactt ttcagagagc tcttctctga ttgccctctc caaaactgtg 6480 ttcaactaca acactctata cttctctttt actttcttat tatcctcatc agaaatccta 6540 ttatttgttg acttgtgtat tattcactta tccctctaga atgtaagctc tttgtcttac 6600 cactgtattt ctaaaaccta gaccgaaact tggtatgtag tagagaatta atgaatattg 6660 gatgaatgaa tgtccctgct tgtgctcctg atggcacctg tgcttagctc tgttgtagca 6720 atcctcacac tactaccatt gggtttttct tgcctgactt ttccactaag ctttctgaaa 6780 aatcattatt ttttactatg tctagtgtct gtcaataaat gtgcactgac aaaataaatg 6840 agtgaatgaa tgggaaattg atgttgagat gccgttttta aagagtaagt accttatcac 6900 acatagctaa agagaccaac tgacacatcc accgtactag tgggaaatct tagatctata 6960 agtttagaaa catttctatc ttccaagtta ccacaggaga cagtgttact gattgtttca 7020 tgattataaa caggaccccc atcttcccac ctcctatggg aattttctta ttgttcttcc 7080 agtttccact aaccagtctc ctcactgccc ttctagcctc agatcatccc ccaatcccaa 7140 agcccaggcc atatatgtta gcggtttttt tagacagcat ctcacttcca ggtactgatt 7200 tctataccag tcatttatgt ctttgtaaca accatcccaa agcttagtgg cataaaacag 7260 gaatgatcca ttaattctca tagttctgtg gggcggctga gcagcttctc tgctggttat 7320 gcctgggttc actcatgttg ctcattcaga tgtgagttca actggaatga aaggactgag 7380 atggcctcac tcatgtctgg ggtcttgatg tggccgtcag ctgggatagt tggaagagct 7440 ggacttcttt ctccatgaga tttttaaatc tgagatatct gcaaacagta tggcaatctc 7500 agggcagcat tctttcttag atatggaaaa cagaagctac aaggtctctt aaagcctggc 7560 ttagaagtcc ccaacctcac ttccaccaca ttctgttggt caacgaattt attagaccag 7620 ctcaggtcca agggctagcg aaacagactt tgtgcattga tgagaggtgt ggcaaactca 7680 cattgtaaga gcgtgtggaa aggaagcaat tgtggtggcc atctttgcaa acaatttcca 7740 tgagtaagag gcataatatt tcaaagatcc aattaagtat ttgaaaaaaa aaatgacaag 7800 ttagtcacgg agaaagcatt caacaaatag cttcttgacc attttgacta atgtacataa 7860 tgcgaaaaaa gaaaaaacaa actttcctga ttatgaagag tgaacattaa gaagaccaaa 7920 caaccacaac aacaaaacta ttcctggtag ttctgatgga agagaattaa caacgggaaa 7980 ttcgcgagat taattaaaac tatttccggc ccagcatggt ggctcactcc tgtaatccca 8040 gcactttggg aggctgaggt gggcagattg cttgagctca ggagttctag accagcctgg 8100 gcaacatggt gaaatcctgt ctctacaaaa aattaaaaaa attagctggg gctgggcacg 8160 gtggctcatg cctgaatccc agcactttgg gaggccgaag ctggcagatc atgaggtcag 8220 gagatcgaga ccatcctggc taacacggtg aaaccccatc tctactaaaa atacaaaaaa 8280 ttagctgggc atggtggcgg gtgcctgtag tcccagctac tcgggaggct gaggcaggag 8340 aatggtgtga acccaggagg cggagcttgc agtgagccga gatggcacca ctgcactcca 8400 gcctgggtga cagagcgaga ctccatctca aaaaaataaa aataaaaata aataaataaa 8460 taaataaaat tagctgggca agtggtgcat gcctgtagtc ccagctactt gggaggctga 8520 ggtgtgagga ttgcttgagc ctgggaggtc aagactgcag cgagctaaga tcgcaccact 8580 gcattccagc ccaggtgaaa gagagagacc ctgtctcaaa acaaacaaac aaacaacaaa 8640 aaccctgaaa caaaacaaaa caatttcctg gggtggtcaa ctcatgtata aaatggttcc 8700 catggggaga ctggaaagcc tatgcttaag agatttccaa taaaaatggt taccattcac 8760 taattagtct acattgtggg gaagcagact tagttgttag atttctttat gtgacaaaaa 8820 aaaaatcctt cttcggctct tgtgtgtatt gtgttgtatg cctctattca ctaatttctg 8880 tcctatggcc taagcaagac tgattctaaa ggtggttatc tagtaaggta tattattcat 8940 tcactcattt atgtattagt tactttattt attcatgcaa ccatttgttc atttactcaa 9000 ttaacaagaa tttattgtgc atctaagatg tacctggctg gcattgtgca aggcattgga 9060 gatccaatgt aatctctcct agagctgctg gtttaatggg agacacagaa aaccaaatcc 9120 acaaaaacaa aggttgaatt acaatggtaa caccactgca aagaagaagt acccagggct 9180 atgaacgtct cctggagttt gtgctaatca aggagttgag aagcagggct aggcctagag 9240 ctaggcaagt gaggcactga gggtgcaaaa tttaagaaga cattcacttc caggttcctg 9300 gtcctggaag cctcaactcc aggtgcttct tctacatcta gttggaaata aaggtcagct 9360 gttgtttagg caccattgtg taagctccag agagcagaga cgctttgaaa gctgtggata 9420 tctagttgat gcaggtattt tattatttat ttatttattt gagacagagt gtcactctgt 9480 cacaagactg gagtgtagtg gcgcaatctc ggctcactgc aacctctgct tcctgggttc 9540 aagcaattct tctgcctcag cctcccaagt agctgggact acaggcgtgt gccaccgtgc 9600 ccagctaatt tttgtatttt tagtagagat ggagtttcac catgttggcc aggctggtct 9660 caaatccctg acctcaggtg atccacccac ctcagcctcc caaagtgctg ggattacagg 9720 catgagccac tgtgcctggc ctgtcatagt tcttgacaaa caagtcatat ttcttgctgc 9780 taatgggcac aaacaaaact gaggttgatt agtttgaata tcagtataag ttgatcatag 9840 atatgatcca gagaatgcta ttattaaaaa tttaattaaa atgttaattt aaatattaaa 9900 ttttatttta atctatcact aaaaatctat gtagtggtta ctaagatagt aataaaaatt 9960 gtttggttct tatagagcac tgtcttcaga atgatgtggt ccaatggtca tattggctcg 10020 cagccatcct tgttggccat atgttgtcct tgacgtgtct ctggattttc ggaagtatct 10080 atttattcta aatctgtatt tctgtcctgg aaagaacttc agtttgagtg ctgcttgaag 10140 accaaatact tggaaaaata aaacgttttt tcttttccta gagctctttt ctctttttcc 10200 ttaagaatat atttccaggt caggcgcggt ggctcacgcc tgtaatccca gcactttggg 10260 aggccaaggc gggtggatca cttgaggtca ggagtttgag accagtctgg ccaacatggt 10320 gaaaccctgt ctctactgaa aatacaaaaa ttagctgggt gtggtggtgc atgcctgtaa 10380 tcccaactac tcgggaggct gaggcaggag aatcacttga acccgggaga tgaagattgc 10440 agtgaactga ggtcgtgcca ctgcactccg acctgggcaa cagagcaaga ctctgtctca 10500 gaaaaaaaaa aaaaaaaaaa agaatgtatt tccagccttt gggcagaatc taaggcttcc 10560 tctgagcata aaatgatgat ggttagaaaa ccttcatgtt ttttcctgcc ctttaaaaat 10620 ggattacata gatatccaac aaagtgtata ctttctatca acaactgagt tcatttgaga 10680 caatgtcacc aaggatgatg tcttgatatg tagagttcat agtgaactag tgcatgtaat 10740 cggtttttat cttcatcctc caagcactaa gattcctcta aatcagagtc aaagttattt 10800 aattgcagca gtgtctttcc aaggtctccg attgcaaatg gtttatacta agtcagagga 10860 gctccatacg ccatgaactt cccatgacac acggtatcta aataagtaaa tattgataac 10920 cagaaagtgg gagtccaagc taccagtaac tatgtgtctt atcacaataa aaacaaacag 10980 tgcataccat ttctgaaata attcctgggt ccttaaatga agacttaggt tacagcattt 11040 tagaaataag caaaaagtaa aataaacatt ttgtccctcc tttcaccaat acccaaggtt 11100 taaactttgt atcatggggc tgctgcttct ccttcctcct cctttttgta tgctgtacac 11160 aagttccttc cttcccatct ctactgacgc cacgctgcta tgcactctgg tcaccacaca 11220 acaacttcct gaatgtttct ccttctttta ctgcttcccc tccccagccc accttctcac 11280 tagagtaatc atatctactg attttattgt gccactcaaa atttccagta gttccccatt 11340 gaccacagtt aggttcccca aacattttat ggggaggaat tactttttcc tcaaagcttt 11400 aagaatggtg tgtagcaagg gcaaggatgt gtgagaaaga gagaagccct gtatgaactt 11460 gtgtattttt acataggaag ctttccatat ataatttcat tcaatctttt tcattgtgaa 11520 atattttcaa catgcaacaa aatagagtaa tataatcaac ttccttgtat ttattgctca 11580 acttagaagg agagcattac aggtattata gaaactccct atgtacctgc ccagtcacat 11640 tctgcagctc tattctgaat atggggctca ttactttcat tcatgtcttt acagttaagg 11700 caaggatcac aagctttttc tacaaaggat agactgaata ttttagactt tgcagaccat 11760 atggactgta ttgcaaccac agtacttaac tctgtactgt ggctcagaaa ggagccatag 11820 agagcaagta aacaaaaggg caaatctggt ccttggcagg ctgggttggg tctgagagtc 11880 atagtttgct gatctctgca gtagagtaac agtaataata tgtagcactg ctttgcatat 11940 ctttatatca aactgtatgt attattctgc aacctgattc ttttgctcaa tatgatgttt 12000 ccccatttat tcatatgaat acatctagtt ctagttcatt ttattttatt ttatttttta 12060 attttttttg agatggagtg ctctgttgca caggctggag tacagtggtg cgatctcggc 12120 tcactgcaac ctccacttcc tgggttcaag tgattctcct gcctcagcct cccgagtagc 12180 tgggattaca ggcatgtgtc accacccccg gctaattttt tgtatttgta gtagagaagg 12240 gatttcacca tgttggccag gctggtctcg aactcctgac ctcagctgat ctgcccacct 12300 cggcctccca aagtgctggg attacaggtg tgagccacta tgcccagccg ctagttcatt 12360 ttatagtcat ttattttagt tcagattata attattttaa ttgctgtata gtattcacct 12420 gtgaatgtgc aatttattta ttgtatccat tctctttttg atggacatta tgctgtttcc 12480 aatttcttac cattggaaac aatgtggtga actttttttg tacatgtcac tgtatacata 12540 tatttgaaaa cctatgaaag attcactttt taagctttac ggcaacccta ttatttcgat 12600 ttaaatcccc aatcagggga gtgtctgggt ggtagtgtca ggaggaggtg ctgagtgagt 12660 agttccaggt catgcgtgaa gctacggaat gtcagaactg gcactccacc cttaatctcc 12720 tcatgtctca tccccagtga tatttatact agaccaaagc tattatttct gcatatgtaa 12780 aacattgcca gctatgcaaa gggaatatga aataaatata agcaatgatt cctctcttct 12840 agaagtttac gatctggtga tatagtaaag acaaaacaga tgaaatcaaa gcattaatta 12900 ctatacaagg aggagggaga gaaaaatgac aaggaagcat caatcaacaa actggaacat 12960 ataggggaaa aatctataag actgagaggt gcaagagcac atttgagcaa atggagacat 13020 gctgtgccct tgaaaggcaa aattaaatat cagaaatgag cagattttct ccctaaatat 13080 ctacgtttgg caccgtctca aactcccaaa gaaacttttc tttttaagag tgtgataaaa 13140 gatatgataa ttttatcaca gaaaacaagc aagggtatct taggaaatgt tgaaagagaa 13200 tgatcgataa taagaaagga tgatattgat acaagaagta actgcacgat caatggaaaa 13260 ctaaagtctt aaagaagcac tatgaaccaa tgggtgaggg aaaatgtact caagaagcaa 13320 ttctgaaaaa aaaaaatcca ccttgtacca taaactaaaa ataagaagca ataaaagtaa 13380 cctttaaaaa atcagatata tttaactaca tgcaatgcaa aactttttag gtaaaggtaa 13440 tttaaaggca tgtagtaaat ataatagaat gctaatacat tcaacgtatg aagaactctg 13500 ccaagtcaat aaaaaaagga acacttcagt agaaaaatgg caaaggaagt aaacccaaaa 13560 tggaaactgt cacacatagc caatatagat gtaaagaaat gttttttaaa ttactgctaa 13620 tcagagtaat ttaatacaac aagataggat ttgtcaccta gtaaattaga aaaaattgtt 13680 caaagtgaga gtgggtgatg ctaataaagg tatgatgaga aaagcagtct tactgctggt 13740 gggagtataa atggaaaaaa atgatagaac acaatttgac attttttatg ctttgctgta 13800 ttgcctcaat tttctgcaat gaccctttta ttttaaaaat actttagctt aaaaataaat 13860 tttaataact tttcagaaaa acagtggatg aaggaatgct taaagttggg gttttattaa 13920 gttaacaaac tattaattga gactgattag gctaacagag ggggcagctg gctaatctat 13980 cctgcatcca ccctatccct cttccaggac aggggtgggg aatgggggag tcagggggaa 14040 ggtagggaag tgatcatgtt caaatcagat ctttttcttt tctttacctg ctctgattct 14100 tcttcttctt cttttttttt tttttttttt gagacgagtc tcactctgtt gcccaggctg 14160 gagtgctgga gtgctggagt gcagtggcgt gatctcgagt tactgcaacc tctgcctccc 14220 aggttcaaat gattctcctg cctcagcctc ctgagtagct tggattacag gcacctgcca 14280 ccacgcctgg ctaattttta tatttttagt aaagacgggg tttcaccatg ttgaccaggc 14340 tggtctcaaa ctcctgacct caagtgatcc gcctgtctca gcctcccaaa gtgttgggat 14400 tacaggtgtg aaccactatg tctggccttt agctgcattt ctaaggagat ttgcttcctc 14460 tacttcatgt gttctgccag ctgttgacca tggctatcac ctgacaatgt tgttttcctt 14520 tctttctttt ctttttgaga tggaatctca ctctgtttcc caggttagtg cagtggtgca 14580 attttgggtc actgcaactt ctgcctcccg agttcaagtg attctcctgc ctcagcctct 14640 cgagtagctg ggattacaag cctgtgccac catgcctggt taatttttgt atttttatta 14700 gagatgagat tcctcatgtt ggccaggctg gtcttgaact cccgacctca agtaatccgc 14760 ccacctcagc ctcccaaagt gctgggatta caggcgtgag ccactgcatc cggccataat 14820 gttgtttttc tttctgatct ggtgttctct cttccacaat agactgcttt ctctacttgt 14880 gccttttggc atggaaaatg agctgcccca tttgtagaga cagtctcatg gacaaggcat 14940 agttttgagt aaatgggtac tcataatagc tagtcataaa gtaaatcctg aaattttaaa 15000 aaaatgttta aaatcttaaa gtcattagaa attaaaaaaa ttcaaattta tatacatatt 15060 ttagtggcag aaaatatgat aaaatgacca ataacagact ctcacgtgta gacatatgct 15120 accttaggat aaaattctgc catgaagtgg aatggaatga aatgtaaact tgaggaagaa 15180 caggaatgat taaaaagttt ccagctgtta aagaataact tcatctattt cttaaaagga 15240 taagggaggg taaaaagttg ctgtgatatt tagatttaat cgaattattt aaaagtaatg 15300 caacaacatg taaaaatgtc aatttttata atacatcaga aattatatta cttgcaacta 15360 ttttattttt tagtgaaaga cattagatgt caacccaaaa agtgtgttga aggagcatat 15420 tgtttttcaa gggtctttca aaggttaaaa ttctcaacct ataacatgac tccttttatg 15480 gaagggagga tttgaaggaa aaaagactat agagctaaag tcaagggtct cctgggagtt 15540 cttcttttgc agttccttcc ccaaaatgaa agtgaagagt attagagaaa gaagagcacc 15600 aaagtaaccg taattagaca aataatgtat atggaaactc ttggcaaata ggctaaagtt 15660 gctaatccca gtaagaagct gttaagaggt aaagaaaatc tgactattat gagatgtatt 15720 taattaagtg ggtttatatc cccaaggtaa tctttacatt tattcatagt ttacctctaa 15780 cactactttg attgtttgaa tcccttctgt tctttgtttc aggcagggat taacatactt 15840 tttgtgtaaa aggctagata gtaaatattt ttgggtttct gaaccgtgca gtcttgtgtt 15900 gcaactactg tactcaactc tgctactgca gggagaaaag agccatagat attccataaa 15960 caagtggctg tggctgagta ccattaaact atttataggc actgaatatg aatttcatat 16020 aatttccatg tgtcataaaa tattcttctt ttgttttttt taaccatctg aaaatgtaaa 16080 accattctaa gctcacaggc tacacaaaaa caggcagccg accggatttg gcctgcaggc 16140 aatagtttac caactctgtt tttagataca gtatttgaag gcccttttat actgaagagt 16200 gagaacaata ttctggaagg atacacaaag aattagcaat tgtaattgta tttaggtaag 16260 gaaaatgggt gtctaagctg ggaagacttt ttttcagtat actgtttgaa ttgtttacca 16320 tgtgcttgtg ttactatttc aaaaatttag tttctttttt tttaaagccc attcgggaat 16380 tgtaaaccca atgaatctga cttgagtgga ccctgctaac tatctggatt ctagagagtt 16440 ctgctgttag ctaatcaggg gcattcacat ggtaggtctc caatcaggga taataactta 16500 aatgatgcag aattagggga actggacttt tcatttggga gagcaagtga taaattatgt 16560 cctaagcttt tgtgccaata gggcttataa catttctcca ggaagagcaa gtcaaatgtc 16620 atagaatttg gcagaactaa gcaagaaaaa caataaatgc agataattta atagagccaa 16680 catttcttga gggctgactg tatgccaggc ccttaatttg aaggtgccaa gtttatatga 16740 atgtattgaa gaaaatatat cacaggaatc caaaatgaag atgagtgtgc ctggaaaata 16800 ttaaaaaact gtaatgtgca gtggaagtac gggggagctt gctgaacatg gctcaaggtt 16860 aaaaaagaca gatgtgagac aaggagagca cagagggaaa cctgaagtca gagtcacaga 16920 gctgtaatat cctattttta aagaccagag gaaatgggcc ttaagttaca gagtataagc 16980 ttgggttgaa tattaggaaa actttctgac cgttcacatt gtgggttcct ggaaaatatt 17040 attatgggaa attgtggaat ttcctattcc caagatagga gaaactctag aacaataatt 17100 gcctaaattt tgatggggag tggggtattt aagcccattt attatacaat atacagacac 17160 atatacatat ttagtatcat gattcatata tgtattcata tatacacatt atatatatcc 17220 ttatacaatt gatatgtaca tgctgaaatt tgaatttccc tagtcctcaa ctagcttatg 17280 agtatgaatg aggtcctgtt aataacaaca actcaacatt tttcttttga gactttagct 17340 ctgtcgtcca ggctggaatg cagtggcatg atcagagttc actgcagcct cgacctccca 17400 ggctcaagtg atcctcctgc ttcaacttcc caagtagctt ggctgggact acaggtgtgc 17460 atcaccatgc ccagctaatt tttaaatttt tattagaggt gagttttgtc atgttgccca 17520 ggctggtctc gaactcttgg gcttgagtga ttctcctgcc ttggccttcc aaagtgatgg 17580 gattacaggc agtagccact gtgcctagcc agaaactcaa cttttactgg cttactcttc 17640 aggacctccc gctcctatca gagcctccct ccccgacaac accccagtta tttcaggagt 17700 gccttggggt ctcttcatgt ttataactaa gtctgctctg gctgttcctt ccactccatt 17760 ttgtgtccat ggggcagagc agtgcccaac cagtagtcat gattcctctg ctgggcacct 17820 ctgctgccca gccctcttgg gttcggagtt ccactgtgat atatggcata gttgtttgag 17880 gatgagggtg aggttgaagt gagttctctc tctcttctgc ttgttgcatt ggacggatct 17940 gccttatacc tcaatgtttc cagacttcat ttttatttac aacccttttc ttctcacctc 18000 tgggtggtag gaaggtctcc tgaatctagt atgggttggt caaacacagc tacactgaaa 18060 attacatatg tctctaatgg ctaaggcttt cctttggggt gaagttcatg gtccctagct 18120 ctctatgagt ttagaatgga ttttctccca ccaagccgct tatattaatg gatagtagag 18180 tttggcagta gggcttggtg gaatttgatg ggctatgccg tgggtttacc agatggaccc 18240 agatatgcct ggaacccaca gtggacacag tttgtgtctc ttctcagaag actctccctc 18300 attcttgctt cttcatgtgg ttcttctctg ttgtcttccc atcaatgcag attgctctag 18360 tatcgcccat ctttaaaaat tgcccttgaa actacatcct ccctccgcta cttctctatc 18420 tcttgactct ctatctcttc acatccaaac ctctctatta cagcatcctg ttatttgcct 18480 tcataaacag ctttcatgac aatatataat tatctatgta tttgggtgat tatttaatat 18540 ttattcctaa ttaagcttga tatcatggat catgaatgtt tattttcctt ttgccatttt 18600 atcttcaaat atctagcaca gagtttagtg cagaatatgt gctcactgaa tatttgttgc 18660 aagcattgaa taaatgaata aatgaaggaa acaggtttct ggtggacgac tttcccagca 18720 gccaagtgca cctgaaactc acttctgttt gagctaggac atttggtgag tgagagagaa 18780 agagagatac gaataatggg gaaatcctaa tgaaatacat acttatccaa aaaacccact 18840 atatcctctt cctgggcagc aaatatagac agctatttat ttgtaaataa agtcatacct 18900 tggtggagac agaaaaaaaa ggtttgaaaa tcatcctcag aaaacagctt gggaaaagag 18960 attctgacag cacacattat cacttttcat tatttaacct ttcaaattac ttctttaggg 19020 aggtagaaga catcgtagaa ggagtgaaag aagaaaagaa gacttagaaa catagctcaa 19080 agtgaacact gcttctctta gtttcctgga tttcttctgg acatttcctc aagatgaaac 19140 ttcagacact ttggtgagat ttaccattat ttttcttctt actatatctt tttaaaaagc 19200 cccggatcaa tacatgacca aggtagatca tgacagtgaa aatgaaagcc cctgaaattt 19260 cctctaaaag gcagatacaa ggaatatctc tgattttctg atctgctgac ataatgccta 19320 aatagtcaaa aaaaattcca gtaggaagtg gctttaaatc actattaatg tttataaaaa 19380 cctattcaac tttagaataa agtttaagat aataaatcat ggattcatct cattttaaat 19440 caaggccttt tatatttgtt tttttttcta ataaagacat agtcaaaatg tcatcatatc 19500 ttgcccaaat ctcccaaggg ctgaaatttg aatgaaaaca tacggtgttt ttttcagata 19560 aattagtatc ttttatgcaa gcacgtatct tgggcatttt taggtccaaa tgaggtataa 19620 aaatgataaa tattgagaaa agatacatac agaaaggcta tagtaaaatg cattcaaagg 19680 gagccatttg tactttcgta aactatctcg ctgtctcctt ccccagctct cctaaccaac 19740 ccttcccgca ttcccacagt cacaaactat ttattttcct ttcaggagtt ttttttgaag 19800 accaccataa agaaagtgca tttcaattga aaaatttgga tgggatcaaa aatgaatctc 19860 attgaacatt cccatttacc taccacagat gaattttctt tttctgaaaa tttatttggt 19920 aagttgtcaa gttcatttga atatcaataa gaacaaacta cgatttggaa atacatgagg 19980 aaatgcctag atgatgagtc cagcccaggg tcaagaacat gtttttgttc cttccaagta 20040 ctcaatcaat gttagctact ctgcaaggtt agaggtagat cggaccttgg attcatggga 20100 cttaatggtc ttcattttta tagatgagga ctgcaaaagc tttgtgaatt gctcaaggcc 20160 gcgaggctag ataattaaca gagcagggag tagaaagcga ttcctcctct aaggctaatt 20220 ccaatattat tatagcaggt actaaactat gaaatcatga ttcttatggc ttaatatgac 20280 acctattaaa ggctgagttt acttaggtta tagctgggca gatttccttt tgcagttcct 20340 ccggtgctgg gggatctatg gcacagtgtc aggattagtc atttaagcag gattgaggac 20400 aaattgtact aagcagtttt cagtagatct gtaactagat gataccataa aatgatatat 20460 tgctccactt tatgcattta tgaaaaacat ctcattataa agagtttcaa ctgcaaacat 20520 taacaaagat ttaaccaagg caatattttg ggtgggacag ggagcctttc tgccagtgtt 20580 aaaataattc ccttctgcat aatagccctg ctggcagcct gggatgctcc cagttttcac 20640 cctccgttaa tatatgggcc atgatatacc ataatagtga aaatgcaaac actagaagtt 20700 cctcaaagtt caatgtatga aggcatgacc gcttatttgg ttgctgatat tcctaggacc 20760 tgttaattaa taatttgttc tgttacctat tagtcattag cactttagtg tgtaatagtt 20820 gattaggcca tttccaaaaa ggggtcatta tttgtaataa ttatttacat gtgtacctgc 20880 ctccatacag tacaaatgca tttaagggca gtgaccatat ttcattaata ttttattatc 20940 cagtgcatat ttagtaggca cttgataaat aagagtccag taataaataa tagaagggaa 21000 aagaaaaaga ggtttatgga aaaaacagat gttgatccgt aactcttaga gagaccaaag 21060 gcggcagtaa ttatttatta aatgccagtt ctcccaagta cttgctgcat cttatttctt 21120 ccagttggct tatcttcatc ctgtagatgc agtcaagggc agtgggaggc cctgcctcca 21180 tgtctagcca cttgaccatt tgcttctggc ctgcctgccc ttccgtggtt cctcagaaca 21240 gagtcaatgt atagacatct taatgatttt ttcttctagc aatcagaact cacagaataa 21300 ttaggactat tggcacatac cacagataaa gaaaaattgc actcaagatt ttttttcaaa 21360 cagaaacttc catttcctgc tgtaaaattt tatgattgta ggttgagatc tctgagctta 21420 ggcttcatca ttactaaacc aatctattca gctccaattt gcatacagag agaaaattgt 21480 agggttttcc ttacaagcta gcccaatttt cagccagcac agaagatttc agggtgattt 21540 cagcagccat aaggtgaaaa agatgccagt aaattcagaa aaagccacga agatgccgtc 21600 aggttagaag catctaaaga tgattgctaa ggaagcactg tgccttctgc ttttatggtg 21660 tggttctctc ctctctctct tttgctggac cacattctta ttccatttcc taacactgca 21720 tttaaggttt ctgagaatat atagcacaag ctgcatgagc tgcaatgcag agctgtaatt 21780 ctttgctgga gcagtttcct caactcttta tctggtcccc tcttctggta aactaatctg 21840 accatgtgtg accattagga ctccttttcc ttttctccac taagctgtag agccacgtct 21900 aagcattatt ttcaaatctc tttcccctag gccctcagtc tcacttcact taacacatct 21960 gagctaggct gggcatagtg gctcacacct gtaattccag cactttggga ggctgaggga 22020 ggtggatcac gaggtcagga gatcaagatc atcctggcca acatggttaa accctgtctc 22080 tactaaaaat acaaaaatta gcttggtgtg gtggtgcatg cctataatcc cagctactca 22140 agagaactga ggcaggagaa tcgcttgaac cagggcagag gttgcagtga gtcgagatcg 22200 cgccactgca ctccagcctg gtgacagagt gaggctccaa ctcaagaaaa acaaacaaaa 22260 aaaaaacata cctgagctaa aggtatgcca aaggggagga cgaagaaaat tttgattcct 22320 ttattcctct tcctcatctt cttgtgtgtc atgatgacat tgatggcttc tatggcccct 22380 tcatgtgcgt agggtggaag gcaccaggga aaagactggc ccatgaatga gtagcccaca 22440 ggcctaactc agtactcctg agtctaagct acgccagggg ttcagaaggg ttaatccatg 22500 ggttcaactg cccagaaagc tggactcttt tccagtggtt aactgtgtat tgaaaacagc 22560 agtcatgttg ttataataaa ggcaaacaag gttaatggaa tagtctatgg atagctgtca 22620 cttcttctct tcaaatatcg gacactgaca atacgctggg ccttggaata aagcagagaa 22680 taggacaggc aagatcctag tagaaagcaa actcacaata aacttgtaag taaatcaata 22740 aacaaaataa tattattttt ataccttatt tggacctaca gatgcccaag gtagtgatac 22800 atgctatgaa ggcaacaaaa tagggtgatg ccctatagcc tggagtgatg ggctgctcta 22860 agtggtatag tcaggaaagg cccctgagga gtccacaatt gagcacagcc tgaaagacaa 22920 gaagccagag agaacattcc gggaggagac aataacaagt gcaacggccc aaagtctgga 22980 atggactcag aaggaccttg tcttctactc atttttattt aggggtggac aactgctttc 23040 ccattaacca gctgtgcagc cttgaacagc tcactcaagt tcagttggta aaagtagatt 23100 atcaggctgg gaattctagg acatttactc aaattaaaca tttgagtcca caaaaatgct 23160 actgttgtcc acgtcaggaa gtggtgactt aatggtactc acttggtaca cggctcaaca 23220 agtcctttta aggtggtcct gggcatggtc cccttcactg caggttatta taaaggtcat 23280 aattcaatga caactttaag gcatttagaa ggtgacctga agacaaatac agacaagcac 23340 tcactactgg gtcctggaga caataatgac cttttcagtg ataacttgca gcttcttcta 23400 gaggtgacca tgggcccctt actcttcaaa ggctttgctt aaggtgaagg ggaaatttct 23460 tcacccctgt aatgtttatt ccaaaaagca gagcccaaat gaggacttgt gtgaaggcag 23520 tttatgtgag acatgatgag ggagtggagg cgagggagtg ggcgatgcaa cagggaagga 23580 gggaaaacca aagaggtcag ttgtcaagat ccctggtgtg ggggaggcca gcttgatgcc 23640 actgggagca gcacagagaa ctgtcagcag aggggtcagg gactggagca tatattaata 23700 ttaactgccc ctcggctccc gttggttgag cgtcacccca gggggcatta attccccgca 23760 cttctggact gtttttgagc tcaggccaag ctgtctcctg tgggattaga gaagttctgg 23820 gcagaaagca taggaacgtg cggcatgctc ttgaaatgac aggtggtcag cctgcagtca 23880 tcatctgagc tcgtgtatga tcagctgcca caggtatggc cgagatcaga gacgggctga 23940 agggatgtga cagctccacc agaaaaagaa gaaaagaaac ttggacagaa tggaatcaga 24000 gagatggggg ttggaatcac ctccctgaag tcctttaatt tattctgttg ccagacatcc 24060 ctgacaaatc cagacggaaa tctagcccgt cggtaactga atccacgtag aggcctgggg 24120 gtggaaggac acagttacca cacagagatt cgtctccttt cctctgggag atcctgcaag 24180 ccaggaagct tggtcaaaga ttaatttagt ggattaaaaa aggcgtctct gatgaatgaa 24240 taaggcaaag ttttatgcaa ttagtgcata ataaaaagta ataatgacaa agtccacagc 24300 tccccaccat ctgacatagt gaataaaaca ggagatggta tattaactaa agtctatatt 24360 tacttttagc aaattatgtt tttaaaactc aaaaattaca aaaatttaga gactaaagag 24420 acaaagcatt ccaacaataa gagaagaagc aagagaaaat acagtcttat cctcccacac 24480 ccaatgttcc tgggcacagt gattcctgat tcacttagca cagtgacact gggaatggaa 24540 ataaggaata aaatagcggg aaatgaaatc tcagaggcaa gtgagattct ctgaacttca 24600 gtttttaccc acataaaatg aggataataa tgtcacactt cacaggttgt tgggaggaag 24660 gaatgggctg tgtgggaaag cgctctgtaa gctgcaaggc atcctaaaat gttacaatta 24720 atccagagtt tgaaatgtag ttgtcatatt ttaaattgtt gaaatgataa acaccattcc 24780 actcattaaa agaaggaagt actgtttatc tcaacattgt ggagtgcaaa gaactttcaa 24840 ggggccagag agaatcaaca atacgtaaat aaggctgggc gcgggggctc atgcctgtaa 24900 tcccagcaca ttggaaggcc aaggcaggcg gatcaccaga ggtcaggagt ttgagaccag 24960 actggctaac gtggtgaaac ctcgtctcta ctaaatatac aaaattagct gggcgtggtg 25020 gcacgtgcct gtaatcccag ctactcggga ggctgaggca ggagaatcgc ttgaacccag 25080 gaggcggagg ttacaatgag atgggcgaca gagtgagcct cagtctccaa aaaaaaaacc 25140 aaacaaacaa aacaagaaaa aaacaccaaa aaacaaaaac aaacaatatg taaataaatt 25200 tctggttacc tcacctttgc attattacat tgctaacaac agctttggaa tccacgtctt 25260 gtattgcttt tattattgta tctaacagtc tttccggagc cccattctga ttgaggccta 25320 gcaagtgtca ggaggtagat cctatacaat cccattcact tatctttcat ttcctttggc 25380 tatttctcat tgccttatgc ttcacttagc tgtgtatgcc cattttctta gggtgcgctt 25440 tatttcttta tttttatttt tatagagaaa gggccttgct ctgtcaccca ggctggagtg 25500 cagtggcaca atcagagctc actgcagcct tgaactactg ggcccaagca atcctccctc 25560 ctcagcctct tcatgcttta tttgtatcag aatacactca ctttgaaatc tgcttcttcc 25620 ctatgactta tgcttaaatt gaacgaatct tccagatgtt acagagtcag tcagatgttt 25680 taaaatgcaa ttaaaggttt tgagatgcag tttgtagaat atttctgaca tttgatttca 25740 ggtatttttc catctcactg agttcaactc aagtgaacgt agttctgact tcttgggtga 25800 attaaagttc aaagtttcca aaagatagag agataccatt gtctcagata aaatgaagct 25860 ttgtttcagc atacttgaaa tattttgtca ctgctatttc ttctccacag ctcttccata 25920 gctgtaagca gtctctaagg catttaattt actgttactt tggttttttc tttctcaagg 25980 aatctcagtt ctggttctct tgatagtgga gtaaagaaat gggagaagga aagaaatatt 26040 gaatgagcct cttcttatta caaaatagat tactgattct ttcagcaggg gcttgcagag 26100 tttggaaaag gtgtgcaatg agtcagagag ttggccagct ggtaggcgat tccactgcag 26160 tctaggcctt agctctcaac cctgcaggct aagaaacaga ttcttaggga tgccaagtta 26220 aaaaagattg actataaaat catattgcaa atatattaac ttacaacaat aaaaataatt 26280 gtaatgataa cagctagtat ttcttgagtg ctttatagaa attaatgtgt cagatatcaa 26340 gctaagtaca ttttagattt gttaacagat ttcctgggtt ggttccaatc tctgccatct 26400 actaccctgt ggtcttggac aaattcctta actttgcaca agttacttgc cttatttaaa 26460 taaggtgata agaagaatgc ctggcacatc acaagtactc tatttgccac taaaattcac 26520 tctcacaaca cacctataag gtaggtacca ttactgttcc ccttttatgg tgagaaaagg 26580 aatatttagt gatgttaaat aatctgctaa acattccata attagtaagt aggcaactga 26640 aatataatca ggacaaggaa acaaaatcca atgtttaagt ctccaggctt agagaatact 26700 tctattaacc ttgtcaactg ggaaggtctt tcaataagat tatctacctt taaagtctgc 26760 tgaccatgtc tgttctgaag gatagttctc agaaaaaaaa cattgccaac atttactcta 26820 ggacctccct tttttcatgc cttcatagct gatgccccac tcagcgtgag aggtacaaca 26880 ttcacagtgc tcccaaatac tctgccgaac agtgccatca gatagaactt tctgtgacga 26940 aggaaatgtt ctagctctac ttcgtccaat acagtagcca ctattcgcat gtggttattg 27000 aacacttaaa atgtgactgg tgtgactgag gaactgactc taatttaatt taaatcaatt 27060 taatttaaat agcctgatat ggttgggctc tgtgtcccca cccaaatctc atctcaaatt 27120 ataatcccca catgttgagg gagagacttg gtgacaggtg attggatcat gggggtggtt 27180 tccccatgct gttctcatga tagtgagtga gttctcatga gatcttgttg tttgataagt 27240 atctggtgct tctcccttcc tactctctct cctgctgcct tttgaggaca tgctttgctg 27300 ccctttcacc ttctaccacg attataagtt tcctgagact tccccagcca tgcagaactg 27360 tgagtcaatt aaaccttttt tcttcataaa ttgcccagtc ttggatagtt cttcatagca 27420 gtgtgaaaag agactaataa atagccatac atagctagtg gctactgttt tagacagcag 27480 agatattcag gaaaaaatat ttttttaggg aaaaaataat atgagacagt tgatcatctc 27540 taagagaaat tcaaagggat ccaatcagtc ttgttatgag aacattctgt gtgaatttca 27600 ctcagcaaca agggtttcct gggtcagcta aataatcctg ctcatttctt atctagtata 27660 gatgtttctg agggcaaccg acaggtccat ttattttggc cagtgaaaaa cttagtggag 27720 aattcttgaa atggaaaatt ctgaatcatt cgtctagctt catatccttt gggtagcttt 27780 ttcacctatg accctctcag ggaaaggata gctctgggat tgtgaggaag ggagaacgag 27840 gtcagaagta aactgaggac tgaaaatggc tttgcagatg gtgaccccga tcaagcaatt 27900 caaactgaaa tatacccata gaccaccact cttttcctct ttcttggcaa ttagggcaag 27960 tgtaccttag tttttccacc acgggatttt gagtcagaag acctgggtta aagtcccagc 28020 ttcattcctt actaatggga tgactataga caaatcactg aaactcactg agtttaaaaa 28080 taggcttaaa gatagtggaa gttagaaata atacctccct ccaagggtgc aagtgaagat 28140 ctgatgagat aatgcatatg gaaccgcttt ggaaacttgt gaaggctata gaaatgtctt 28200 agcaatgatt attccacttg tatagctcac ctttgccttt caaaattgct ttcatattta 28260 ctatctcatt ttatcttcag aacaggccta aatagggcat gaattattat ctatgtttta 28320 ccatctattg atgtggacaa atggggccca cagaagctga gagactcatc taaggttaca 28380 cagtgcatct cgctacatta tgcctgggct ccaccctgga tcaactgagt ccccttctaa 28440 agccctcatt tctaaaaaac aaccaataga cttaagcttg atgtacaaaa gttatgctga 28500 aggataaaat atagaattat tcctattcaa taagcaaccc aaggtacaac atgagtccaa 28560 agctgagcaa tcataacatc aatgtagtga tgatgggtag ccacctgttg tgctaagcat 28620 tttataaata ctattttagt taataatcac gtataattct ttgaggttgt atagttatca 28680 accccatttt acaaccaagg aaataaaggc agaaaatggt tcaggaattt gtccattgtc 28740 cacagagcta ataagtgcca agacagcctg gatgtattag tttgctgggt ctgctgtaac 28800 aaagtaacat agactgggta gcttaaacca cagaaattat cccatagttc tggaggctgg 28860 aagtccaaaa tcaaggtgtc agcagggtta gctcttcctg aaggctgtgg gtggaggatg 28920 tgctccaggg ctctctcctt ggcccacaga tggccatctc cgtctcctct ctgtgtgttc 28980 acatcaactt ccctctctgt gtttctgtct ctgtaaccaa atttccagtt tttataagga 29040 tacaagtcat attggatctt cattttaatt tgattttctc tgtaatgacc ctatcaacaa 29100 ttaaggtcac attctgaggt actggggttt aaggcttcaa catatataca tcctcatctt 29160 acaatggttc aacctactat gtttagactt tacaatggtg tgaaatagac acttttagca 29220 tagttttcaa taaattgcat gagatattca gttctttatt acaaaaaagg ctttgtgcta 29280 gatgattttg tcttacatgg tggtaggcat gtcttacatg gtggtaggga agacagaatg 29340 agagccaagc aaaaggggaa accccttata aaaccatcag atcttgtgag acttactcac 29400 taccacgaga acagtgtggg ggaaaccaac cccatgattc aattatctcc tacaggatcc 29460 gtcccacaac acgtggggat tatgggagct acaattcaag atgagatttg ggttgggaca 29520 cagccaaacc atatcaagca ttatgcttta aattcttcca ttttaaatca taccattcca 29580 ttttaaatca taccattctg ccaccttttg atggggacaa atgtcaggac ccggagcatg 29640 gcacttgacc tccaagtcca gttttagagc cagtgaacag aaacccaccc tctaaaagtt 29700 cttaaactgg aaaagtactc ccccaaaatg tttatctaag agactggttt ccagcttact 29760 aggcaatttg gcattaagaa ctttcctcat aaacatttac aaatattctg ctccgtaatg 29820 aagttaatca gtaaaccaca caacctctgt cctctctcac tccttaccta gtctcatttt 29880 cagtggctgt gaataagcta agaatggtaa tgcagtttca ggggttagaa aatccaattc 29940 aaattagtcc tcactgcagc tgtacgccgt caggattttt catggaaatg atgagtatga 30000 agcccgcgaa aggtaggaca ctgttcacag gtgctttcag gtcgagctct tgcaactgga 30060 ctgaggaaat ctggggactt tggttggaga tgacagtaac agatgtctgt caaagatagt 30120 ggctcagtca gacccagaag actggaatga gagtttgggt tggggtgatt tttatagggt 30180 tctaaacaga cgaatatctt atggggatga aatccaagga tgtttcccct aatattcttc 30240 atagactctt atgtcaaatt atcactattc ttccaaacca aacattagac aattctaata 30300 ttaatatcaa tggacacttg aaaaaacatt ttaacagaca ttatttaatg cagtgctcct 30360 aaaatggcaa gtctcagttc ccaggggatc acaaaattta tttagtgtct cgtgaccagc 30420 attaaaaact ggattagaat ataatagaat agaaatatta gtgtccctca aggagtaaag 30480 acaagtggta tttcatgaaa ttttgtttca gttacatgtg tataagtatg tacgagggat 30540 atgagataaa atgtatctaa aaccgtaggt tgtggtaaaa aaaaaaatga aaagagtttg 30600 ggaaacacga attaatgaag tagtttcaca gcaggtttct cccaatagtc catgaaaaat 30660 cagataagca cagagttgag ctttaggccc aggctcttgg ttttctctga attaaaacat 30720 cgattataat taaaacacag tataatgtaa atcatatttt aatatttttt aaaacagaaa 30780 ttcatctttt aaatttccag caagaaaaaa ctcgagtcat ttttataata atgataattg 30840 tgaacatgtt tttggtcata gacttggtag cataccattt tgtttaaatg cgatgtttaa 30900 aagtgaccga cactgtattt agaaatggaa atgagttgga aggccgggga agcaaatttt 30960 ctatttactt caatttaacc aacatttaat aaacatttac tatgtgtcag ccattgagtt 31020 agatgccagg aatataaaga aaagaatata gtctaatatt ttccctcaaa gagttcacag 31080 actaatcaca acaatcacaa aagtatacat gtgtatgtgg tgtcaggcac tatgcagttt 31140 ccattctttg tcttagttca tctttattac agcctctcat gcaggcacta cattctttca 31200 gctttgtaaa ggaaggaaca gaagcttaaa gaagttccat gtctttccca ggttcatata 31260 cctaacaagt tgtgggtgaa atgaatccag atttgtctgc tcagagggca ctcttaggta 31320 gtatctcata ccatctcaag ttcagcgtgg aagataagcc tgtaaacaaa agaaatgatt 31380 gtatgtgctt ccaataagtg ccataataga agtgtgtaca gaggtaggag aggtgaattc 31440 cacctggggg aactggggca gacctcccag agtaggtgac cagaagtagg tgatcagctg 31500 acaaggcagg gagaggcact gacatcacaa aggcctgatg ccaggaacag caaacaactt 31560 tgcagtaaag ccctgccatt tagtgaagaa accatggatt aagaaataaa ggaaggcaaa 31620 aaaaaaaagt gacttttcta ggcactgcaa ttcaacagac attcattaaa catttactat 31680 ggtcagcatt gtgccaggaa tgaaaaactg gatataaact ggaccctttc ttaggagcat 31740 cagaaactca ctgtgagatt atgggggagt gccattgttt ttatgcaaac ctgatttccc 31800 atttgggggt acagtggttc tggtcccaaa ctccttagag ctgcactggc taatatgcca 31860 catgtggcta ttcaacttta acttagaatt aaataaaatt taacagtcag actctaagta 31920 tcactagcca cattccaagg gttcaatatc cctatgtggt tagtggctac catattggta 31980 ggtacagaac ccttctatca cctcagaaag ctccctcaga tggcactcct ttagagaagg 32040 tgttgaaacc tggagactgt ctaaccagaa gaattttact ttcttttcag ggcttctcag 32100 actcccctgg agtccatcca tgtgttctca tttaagaact cgtccctaaa cttccatgga 32160 gaaccgtcac aaaggaatcc taattctcat actctttgat aggctaattc caaaataaag 32220 attcctaaat ctaaatttta atatgtattt tcccttttaa attgtatcaa aatagaaata 32280 aatttacctt ttcagaaagc atactctaaa ttaaagtcga actacctcta ctccctccac 32340 atcttatttt ctgccaatgt tgtatttgtt tacttgttta cttatttttg tcctaaccct 32400 tgacatcccc acttcccaaa acacacaaac aagagaaagg ctgggccttc gtatgtgtct 32460 tttcactttc atatgttcag accctaacac agtgactgct aaatagcaag tgtttataac 32520 acacacacac acacacacac acacacacac acacacacag agagagagag agaacatgag 32580 catatgttta taaataataa tttgtaatga aagaactatc ctaatactta tatcatcaag 32640 acagactgtg ttaataatta cttattaact atcaattaac tatgattaaa atcaggattt 32700 tgctaggttc tttaggaaac atgtctttta atgacactca acttaccgaa actgcttctc 32760 agcaaccgta ttttacactc agaactacct tctgagatag caacagtccc atttttatag 32820 gagagattga aataggaaaa gcttagccag ttaagatctc atggtcagtt aactgtggaa 32880 gttgttctag tccaaccatt ttggaatatt tggccatgct ggttctttgg ttgtaaagtt 32940 cctttgtaat tacttctcta tttgtggaga tttaggagac acactatgaa atgctagaag 33000 tgaatgtgct ttgagaatag aaggtttttt ttgttttttg agatggagtc tcgctctgtc 33060 acccaggctg gagtgcaatg acatgatctt ggctcatcac tgcaggctcc acctcccagg 33120 ttcaagtgat tctcctgcct cagcctcccg agtggctggg ttacacgcac ctgccaccac 33180 gtccagctaa tttttgtatc tttagtagag acggggtttc accatgttgg tcaggctggt 33240 cttgaactcc ttacctcagg tgatccactc gcctttgcct cccaaagtgc tgggattaca 33300 ggcattagcc actgcacttg gccaagaata gaagtattat ttaatattat tgtgtacggt 33360 tctcatattt ttactgatgt tctactgttt tagcatgata ttatctatat ctttgtgaaa 33420 gctttccatt ttgatctaga ttattccaga gactgaaaga cagggatatt attatctgac 33480 ttgacttggg actcctttgt atttcttgat tatgaaaata attgttttgt tgaaataatg 33540 cttaaagatt ttagaattgt catatgagga gaaaaaaagt acacatctcc ttggatgcta 33600 cagaatccag attctatcgg tttttgatct tgggcaaatt acttaacctt actagacctg 33660 tggcagagag aaattgggct aagggaggaa ggggagttaa gggaataaga taggttagag 33720 gataaaatct aagtgtacgt gaaactaatg cttttaatac tttttagttt ttttcaattc 33780 cattcactcc agaaattttc atcgtatttg gtgatgaaat agatcaggat tttgcttcta 33840 ttttatcttt tctttttgtg taataattcc atatttttct ccttccattt tgcggtcata 33900 agacaaggca ccatttctaa aatattttgt ccatcagaca tactcgtttt tttgtttgtt 33960 tattttattt tgttttgttt ttgtttttgt tttttttgag acagagtctt gctctgtctc 34020 caggctggag tgcagtggcg cgatctcggc tcactacaac caccacatcc caggttcaag 34080 tgattctcct gcctcagcct cccaggtagc tgggactaca ggtgtgcacc accaagccca 34140 gctaattttt ttgtattttt agtagagacg gggtttcacc atgttggcta ggatggtctc 34200 aatcttttga ccttgtgaac cacctgcctc ggcctcccaa agtgctggga ttacaggcgt 34260 gagccactgc acctggccta tgctgttttt atatggataa gtcacacaga aggaaaggaa 34320 attgattttt gtacagatac tgtataccat gtgttctgtc aatcatttcc tgggttaatg 34380 cagaaagcat tattttcaac atgtaatatg tatattctga aatttggcag tgtagaaaag 34440 ccctaaattc taatgttcat aaactattag tcatatactt atggagactt ttagatataa 34500 agaagaacta gaagattatc tagggtggag tcttctcttt acagatgaac agagggaggc 34560 ccaggtaggg taagtaatgg tttttaaatg gcatgtaact agattcagga acatgactgc 34620 cagggaccta gaaacctctt ctgttgggaa gacttcccaa gagaagcaat acccacaggg 34680 aatttagagg taagtggaca ttagtgagat gaagccctgg aaggggttgg ctgtgttgag 34740 aaccatttca ggctcaggaa gcagcaggtg ggagaagtcc tgagtctgga agcggcaagg 34800 cacattcaag aaacagagat gtccgaacat gcaaaatggc tggaaaagaa agcaagggta 34860 gagtcatgca gaagcttgca ggcccagtta taggttatgt gcaagcggag gccgagaagg 34920 gggtaacaaa cacaagtagg aggatgcaga gaggggcttt gaagatcatg tttttaagta 34980 tcacacacct gaaattaccc ccagagagca gcaattcagt gcatgggctt tgagattaga 35040 ctcctagagc tcaagtctca tctacttagg taggtaggag taaagttgct taactactat 35100 gtgcctcatg gtactcattt gtaagtgggg gttaataatg ttacccacct catagagttt 35160 tgtgaggaat aaatgagaac atacaagtgc ctgctactac gaagaactct ataaaagttt 35220 tctgcaaaat gtggtaatag taatcataat aattatagta atcgtttcaa ctatcttttg 35280 agcacagcaa agagcattgc caatttgaaa tccccaaagc cccaaaaaac cacgtttttc 35340 taaaagtttg gcacacactt ttttggtgac agaacctgac ctgaatgaat gtgaaactat 35400 ttatggtctt tacttattgc acttgctgtg aatattcata cgtttccctg tggaaaatat 35460 tgctgtattt gattctgggt tctgttccag actgtgctgg gagggatata taaatagcac 35520 tgtattctgt tctgaaatcc tgaaaattct gacttctaaa acacatacga ctgccagggt 35580 ttcagataag gaactgtgaa tctgtatcat ctcactcaat cctcagacta actcttgata 35640 tcgatattat catcccaacc ttgctgccat gaaaaccaag atttatcaat agtttgaata 35700 atttgctcaa aatcccactg ttcataagag atggaattgg aattttggtc tagaattgtc 35760 tgacaacact ttgaagagtc agacacattt gtaaactttc ctatctctta tttatgtcta 35820 ttttttcagc atttgcataa tttcaatttt ttattttcta aagagccata gttgatgttt 35880 ccaggaaata gaagcatttt cataacctga tgattttcta ttacctagag gttttctaaa 35940 taagggactc ctttaataat gatggtagta agaataattg ttattattcc tattttaagt 36000 cagctataat ttatcgagca cttctgtgtg ccaaacattt agcaaagcac ttggcacaca 36060 ttatctcata taatccttac atcaacccaa tgaggtaggt gttattattc tctgttttat 36120 tagaagaaga ctgcaactta aagacattaa ggaattgcca gctgcacaca gattgtaagt 36180 aacagagcag gaatttgaat ccagtagata gattgcaggc actgtattct tttttttaaa 36240 gctcctcctt tgttttataa tgattaaaat aacacataca tatacacatg gcaaaaacta 36300 caaacagtaa aaaaaattat gcaatcttct atcccagaat ataatgtaaa aatatcacct 36360 ctcacagtca ccaccaccta ctttcttctg tatcatgcag aaatttgtct ttgcacgtgg 36420 gcacgtgcgc gcacacacac acacacacac acacacagag ttggtgttta ataccgatgt 36480 gaatatacac attggtctat agcttacctt tttcatatag caccatattg tagaaaattt 36540 ccataacagc acatagaaat ctaaataatt gttttacatc tacttaatag tgtatggatg 36600 taacaaattt attgaagcag actcctatgg atgaatattt agatcgtttc caatttatgg 36660 ctcttgcgaa tgtgctgcaa gtgaatattt ttacaaaata ctttttcata gaggttcaag 36720 catatttcta aaataaattt tcaagcagtg gaaaatttat gctgggtgaa aaggacattg 36780 aagtttaaaa aactggaaaa tattgacaaa tttcccccca aaaatatttg tttccccaac 36840 agtgtgtatt attaaacttt ttaaaccctc acacatttga agatgaaaac tgtaatgttt 36900 tagtttcaat ttaaatttaa ttattggcaa aggtgagcac atttttctgt ttacaaaaca 36960 cggttatatt tctaacattg actattcctt tcctttatcc ctttttccat agcttctcaa 37020 atattttctt attggttttc aggagctctt tatacattaa tgagatgagt ctgtcctcta 37080 tgttgcagat atttttctga atttttaatt tgtcatttaa ttttatatat atatatatat 37140 atatatatat actctaaaga agttgtaaac ttttatgtag ttaaatgttt caatcttttc 37200 ttagagccca cactcctaac cattacgcca aactgcctct ctaatttcca gaatgatgac 37260 attcattcca gttttgttgt cacttttgtt caggtgtttt aacagaacaa gtggcaggtc 37320 ctctgggaca gaacctggaa gtggaaccat actcgcaata cagcaatgtt cagtttcccc 37380 aagttcaacc acagatttcc tcgtcatcct attattccaa cctgggtttc tacccccagc 37440 agcctgaaga gtggtactct cctggaatat atgaactcag gcgtatgcca gctgagactc 37500 tctaccaggg agaaactgag gtagcagaga tgcctgtaac aaagaagccc cgcatgggcg 37560 cgtcagcagg gaggatcaaa ggggatgagc tgtgtgttgt ttgtggagac agagcctctg 37620 gataccacta taatgcactg acctgtgagg ggtgtaaagg taagcatctt tgattggcag 37680 ttttctcctt caggttttac tatattggtt gttgaaatcc cttgaactaa ttgcttccct 37740 tttcccaggc aacttcccat tttctcctcc tgtgcgccta cctcctccta catcctcacc 37800 tttgttgtgt agtcaaagat ctgtaggaaa agactgttag agtatttagc ttaatgttta 37860 agactgttag agtatctagc ttagagtgtt tagctttctt ccttttcaca acctcacacg 37920 tagctaaagg gaacttatga ttaaattgaa ttttttgtag gttttgtttg tttttgttgt 37980 ttttttaaaa aagaaagaag aaagtgatga acacatctct tttaaaatta agatctagaa 38040 gaatacaact ttattcaata aatatttaaa aatatttttc atgtacttta gttaattgag 38100 aaagagagaa agaaagagcc aagtaataat agcttatatt acaaaaaaat attttaactt 38160 ggcatggtgg ctcatgcctg taatcccagc actttgggag gccgaggcag gcagatcacc 38220 tgagatcagg agttcaagac cagcctggcc aacacggtga aatctcgtct ctactaaaaa 38280 tacaaaaatt agctgggtgt ggtggcaggt gcctgtaatt ccagctactc aggaggctga 38340 ggcaggagaa tcgcttgaac ctgggaggtg gaggttgcag tgagccaaga tcgcaccatt 38400 gcactccagc ctgggcaaca agagcgaaaa tccatctcaa aaaataattt tttttttttt 38460 tatttagcct aggcatggtg gctcatgcct gtagttccag cactttggga ggctgaggtg 38520 ggtggatcgc ttgagtccag gagttcgaga ctagatagag caacatggga aaaccccatc 38580 tctacaaaaa atacaaaaat tagtcaggtg tggtggagca cgcatgttgt cccagctact 38640 tgggaggcag aggtgggagg atcacctgag cccgggaggt tgaggctgca gtgagccgtg 38700 atcatgccac tgcactccag actgaatgac agagagagac cctgcctcaa aaaatatata 38760 ttttatttag cagtaataca gttctgttat ggatgctata gagaaatcat aacaaaattg 38820 caatccattt aagatggata aaatataata aatgaaatat aaaaatgaca atgtaggccg 38880 ggcatggtgg ctcacgcctg taatcccagc actttgggag accaagacgg gtggatcatg 38940 aggtcaggag aatgagacca tctgactaac acggtgaaac cccgtctcta ctaaaaatac 39000 aaaaaaatta gctgggcttg gtggtgcatg cctgtagtcc cagctactca ggaggctgag 39060 gcaggagaat ggcgtgaacc cgagaggcag aggttgcagt gagccgagat cacgccactg 39120 cactccagcc tgggcaactg agcgagactc cgtctccaaa aaaaaaaaaa gacaatgtaa 39180 aatcccatgt gattactgtc tcaactaagg cacagataat cgtggtattt attcattaaa 39240 caaatatttt gggaacacct accatgggtc aggcactctg ctaacatcta gaacattgaa 39300 tactgaataa aaaccatctc ttgagcagaa tctttcggat ggtgtgacac actggggttc 39360 cctgatgttg gtggttgcca cagaaataaa cagaggaccc agtcacataa gatcggctca 39420 aggacatttc aaacctttga aagttcaaat gatcagattg tgtaaagcca cattagtcac 39480 ttcttaaaaa ctgggtatga tattgttaga atttttttta aataaagaag catttgtaca 39540 ccaactttac aaactaaagc agatctataa agaaaactta aaagatccca cttatggcct 39600 tcagaatgtc cttaaccttt ttgtggagag tcacgtctga ataggtaaat gaacttatac 39660 attcgttgga agcatagatt aagtcttctc tacctatatt gctactcata gcacctagca 39720 cagtgcctgg tacatagtaa atatggaatg tcttttgaat gacacaatga aaaaccatgt 39780 tcatcactgg tacgtaatta atcagcctct actttgcaaa aacacattaa aatttttttt 39840 tacaatactt aattctttcc catctccgaa tgataaaaag taaatattac cctcgattcc 39900 tatatgaagt ggcagatctt tttgagctgg cagagctgtg gagatcgaga tctgcttatg 39960 ttaaagattt taaaatggtg tggaataaag caggaaaaca tttctaattg gttagaaaaa 40020 taaaataggc cggacgctgt ggttcactta tgtaagtccc agcactatgg gaggccgagg 40080 tggggggatc acctaaggtc aggagttcaa gaccagcctg gccaacatgg tgaaacccca 40140 tctctactaa aagtacaaaa attagctggg catagtggca cgcacctgta gtcccagcta 40200 cttgggagtc tgagacagga gaattcctcg aatccagaag gtggaggttg cagtgagctg 40260 agattgtgcc attgcactcc atcctggatg acagagcgag actccgtcaa agacagaaag 40320 gaaggaagga aggaaggaag gaaggaagga aggaaggaag gaaggaagga aggaaggaag 40380 gaaggaaaaa ataaagtagt ttccaaatga caagccttga actggaatta aaataaacaa 40440 aaataaaact tctacagtta ggtttgagaa accaactgct gatctacagg aaagaaaaag 40500 aacaaacttt atagtagctc tagtcaaaaa gtggctgggg ctcttacact ggagggcaaa 40560 ttaggaccaa aagatgaagc ttataggatg gcatcttttg aggcctaatt caagaattaa 40620 ttttttagca ctaacatgaa ttgtcccaac aatgagattg gctttttgcc ttaggtggta 40680 acgaacttct cattattaca ggaattcaaa aagaggttag ataaacaggt ttgagtacag 40740 gagatttgta tgttaggtgg gacactgtat tagctggctt ctgaagtcct tccttaaatt 40800 ctgtgaatct gtagtaccat atttataata ttctaatctg cctgtttcat cttctaagca 40860 agtttataaa tttcataagt taaaaagtga ctataattca tgtactccct cctgttgcta 40920 tgaactgatg aaagggaaag ctagaccatg gtttttaaac cttgacacta ttgacatttt 40980 aggctgaaaa actctgttgt ggggggatgt cttgtacata ctgtgggata tttagtagta 41040 gccctggctt ctatccacta gatcccagta gcaacccccc acttgtgaca accaaaagtg 41100 tctacagaca ttgctaaatg tcctctgggg ttacaatcac cccagttgag aatcactgag 41160 agagactatt cagtgaaaag tataatatag agcttctcaa acttgagcta cacctggaga 41220 tttcttgagg gaagctataa tatttcttgg gtgcctctaa tgttgactta aattgctttt 41280 aatgaattac ttatgctatc aagcaattat agcataagct atattatatt ataatagcta 41340 tattatattg tgtatcatct tttatacact gaaaaatttc tatacacaaa ggaacaaaca 41400 caaatttaca aaattacata caaacaaaac tacaaaacca ataaaataat gaagtatatt 41460 agattgtatg gttttgttgc ccaaaatgtt accaaactca ggttcttttg agtttggtat 41520 cactagacca tgtgtcatat gaggcctcaa ttctccactg cttttctctc ttcgaagaat 41580 tgattcaacc aaacctttgt tcatccagag ctccatttgg ccatccctga gctgggtaca 41640 tctcaggcat atgaagtttg cctttgccat tttgtcaaag cttttttttt tttttttttt 41700 ttttgagaca gagtcttgct ctgttgccca ggctggaggg caatggcatg atctccactc 41760 actgcaacct ctgcctcctg ggttcaagtg attctcctgc ctcagcctcc ccgagtagct 41820 gggcttacag gcgccctcca ctatgcccag ctaatttttg tatttttagt agagatggag 41880 tttcaccatg ttggccaggc tggtctcaaa tccctgacct caggtgatcc acccacctca 41940 gcctcccaaa gtgctgggat tacaggcatg agccactgtg cctggcctgt catagttctt 42000 gacaaacaag tcatatttct tgctgctaat gggcacaaac aaaactgagg gtgattagtt 42060 tgaatatcag tataagatga tcatagatat gatccagaga atgctattat tagtttttta 42120 aagactggca taaaataaaa atgtaaaaaa taaatttaat atgtgtaaag gccttagaaa 42180 aaagtgtggc acatagtcaa aactcaatag atgtctattg ttattattgt tatggtccta 42240 gaattgattt acatatcgtg taatggcaat gtgagtgttt tctgcttata ttgataaata 42300 ttgatgctca aatcatcgtg agttgtttaa caagatttat ttgacataga ttcagattac 42360 ctaaagccac ttcaacttgt ctgtgtagac cagtgctact caaagcatag tacctggacc 42420 agcagtatca gtatcactag agaatttata gaatgtgggg ttttacccca gacttactga 42480 atcagaattt ctgggaaagg ggcccaagaa tctgtgattt aaccagcact acatgctaaa 42540 gtttgagaat actggcgtaa aaatcatatc caaaatcttc ctaaggccac tcatctggcc 42600 aagcatcttg gttttgtgtt gtgagtgtga aacccacagg atgggaaaag agaataggag 42660 gaatgctact ctccatgagg tcatcaaaag cacaattcag tagaataaaa tgtagacttg 42720 aaacctggac tggaaagaca caaagcctaa tctaagtcat gtttcctgct gggagagaaa 42780 gaagcagaaa agaacacgca taatttattc ggaaaatctg attcaaaaga aaaattccaa 42840 tatacatact tttgaaccta caaatttcag tgctgatgac atgctaattc tttttttttt 42900 tctgaaacgg attcttgctc tgtcacccag gctggagtgc agtggcgcga tctcagctca 42960 caaactccgc ctcccgggtt cacgccattc tcctgcctca gcctcccgag tagctgggac 43020 tacaggcacc tgccaccacg cccggctaat tttttgtatt tttagtagag acagggtttc 43080 accatgttag ccagaatgat ctccatctcc tgacctcgtg atccgcccgc ctcggcctcc 43140 caaagttctg ggattacagg cgtgagccac cgtgcccggc cgatgacatg ctaattctta 43200 tggtaggcat tttggaaaca ttctctctga tctttacagc aactatggaa agtaggaatt 43260 attgtgtcta ttttaaaatg aggaataagc atatcatttc caaaacaagc taagatatag 43320 agatgccagt gctaatagct tgtgtgatgt gtttactata ctttagtaat acacttatta 43380 attctcatat cttatgtcca ttttatttta ttttcttaaa tttatgattg acataataat 43440 tgtacatgtt tatggggtac aacatgtttt catacatgta taattgtgta atgatcaaat 43500 cagggtaatt agcacatcta acagcttaat ttatcatttc ttcattatga gaatattaaa 43560 aaacctctct tctagttatt ttgaaataca cagtacatta ttattaactc tagtcatgct 43620 gctatgcaat agaacatcag acatattcct tctatctaac tgtattcttg gacccgttga 43680 ctaatttcac tgtccccttc tccccccatc ctcctcatcc tctcataact accattctac 43740 tctctatttc catgagaaca acgttttaga ttctacatat aagtgagatc atatggtatt 43800 tgtctttctg tgcttggctt atttcatttc atgtaatatc ctccgggctc atccatgtta 43860 tcacaaatgg caggatttca ttctttttta tggctgaata gtattccatt atgtatacat 43920 accacatttt cttgatccag tcatccattg atgaacgctt aggttgatcc cctatcttgg 43980 ccattgtgaa taatgctgca ataaacatag gagtgcagat atctctttga catactgatt 44040 tcatttcctg tggagatata ccctgtggtg ggattcctgg attatatagt agttacattt 44100 ttaattttgt gaggaaacat catactgttt tccattatga ctacacaaat ttacactccc 44160 accagcagta agggttgact tttctctaca tccttgccaa aacttgttat cttttgattt 44220 attgatagtc attctaaatg gagtgaggtg atatctcatt gtagttttgc attttcctga 44280 tgattagtgg tgttgagtgc cttatgccca ttttatggaa gtggaagctg aagcataagt 44340 tgttaatcta tttacccaag ttttcatagc taataagagc ttcaaaacta tagcttataa 44400 tctaactcca gagcctgtat gattgaccac taggctatga tgccttttat tggttgtcca 44460 ctaagtgatt gttctgctct tcagttgttt gctcaagggt aatattgact aattcaatgg 44520 cgagaggaag aaaacgaagc agtccgttta gctgttaaag aatcagtagt gtctaagcca 44580 agggaatgaa tccctaaagt cactaacctt ggatagctta cctccttaga agagaatttt 44640 tctttctgtc ttgtgaactg tgcttatgac caagtgaaac ttggtccacc tgcgaagagt 44700 tctggggcat ggcagggtaa atagagtgag accagtgggc caaggacaac cctggaaaac 44760 acttacctta aaggtggtca ggggaagaga agcccacaaa ggatatagaa tagaagtaat 44820 caagatagaa gtctacaagg aagtgcagga tcacagaagg caaggagatg gagaaagtag 44880 tcagaagaga gccaaccttt ggggaggttt aggaaggaga agctggattt ggacataaat 44940 atatttttaa gagcagagaa agacaaaatt tttccattga ttttgatttg agacgttgcc 45000 cccatggtgt caggaaaagg acaagcatgc aagggagagg atgggttgga accaggcagg 45060 gcccatgttc tatgccaggg attggcacat ttttttctgt aaacatccag acaataacaa 45120 gttgaggctt cacaccacag cagcctctgt tgcaactaat caactctgct actggagtgc 45180 aaaggcagcc acatagaaaa tgagcatagc tatatttcag taagctatac ttacaaaaat 45240 aggcaggaaa aaaaatgaca gtgggctggc tgcagtttgc tgaacactgt tcttgaccaa 45300 tgactttttt tttttttttt ttttttgaga tggactctcg ctctgtcacc caggctggag 45360 cgcagtggca atcttggccc acagcaacct ccacctgggt tcaagtgatt ctcctgcctc 45420 agcctcccaa gtagctggga ttacaggcac cagccaccat gcccagctaa tttttgtatt 45480 tttagtagag acggggtttc accatgttgg ctagggtggt tttgaagtcc tgacctcaag 45540 tgatccaccc gtctcagcct tccaaagtgc tgcggttgca ggcatgagcc accgtgcctg 45600 gccttgaccc atgactctta agggttggtg tacgaattaa ctggaagctg tgtagtgaat 45660 gagcacaaga ttcaactcac agcccctatg tgagagaaat gtaactttat tttttaaaat 45720 tggatttaat tcagtctaga gctcggtgga aactttattt cattttattc agtatcattc 45780 cattcacccc cgccaaaatt cttctggcgg tttatctaag taccaggatt aagggagggt 45840 tgtggaggga ttgggcactg gggagtgggg ctctggcccc acagctgtgg ggctgggtgg 45900 ggcttggtgc cacagctgtt gacacagtct gagatctgtt ggagtgctgt tgtatttggt 45960 cttcagtcac cagtgggaaa ggtctctgtg acaccatcct ggggtactga gcacactctt 46020 ccatcacgtc tggccctggc tgtctccttc attcagctca ggtgggtggg ttctgatgtt 46080 gcatcagtgc tgtgttggga tgcctggctt cccagagaaa ccatgctgcc atttcatctg 46140 ctctatgcca tgctgggtgc agagaaccct gggggtttct ccagttctac ccaggaaagg 46200 gcataagaat cctcgtcttc caggatccca aatgacgtgg agtttctata atctctcaat 46260 ccacctatcc taggattctg tctggggata gggagttggg agggacagag ctcaggacct 46320 tctcaggtac ctaacatgtc ttagctcttt gacttcccct ttagttgtct gcttcctccc 46380 gacttgaata agttttatct ggtcttcagg attgggggct taagctctgg cgtctgaaaa 46440 gccaaaaaca ggctccttta tcttccttat gtgttctcct gtgaaaacaa atgaaaatta 46500 tcctacctta gtgaatggag gaatgaggaa gggctgtgag gaaaaccaga ggagaaggac 46560 agaatagagt ggaatgcata tcagaggacc aatgcctcat gcctgcaggt cttgtgagag 46620 ctgctcagcc aacctggttt ccccgctgac cccgcagccc acttccccca tgcacatctc 46680 ttgatatctg cactcagcag atgcaaagcg aattatactc tgtaactttc catcaaaggg 46740 agcacgggct ttgcgggctg acaagctcac attgaatcta ggctctgcac ttcttggctg 46800 aataaacaag ttacttaata tttactggtt agagttcagc aaaataaaac agaaaccaac 46860 aattcaagca gaaaagaatt aatacagaag ttggaagagt cagagtaaca ggatctatgc 46920 taagtttctg ggagatgctc agagccacac tgcagaactg gcccgccggt tgcttttctg 46980 ccaaaatcaa gaagatgggg actcaggaag ccacctcagg actgttggct ccaggaagac 47040 accacctcag ctgccatcta gggatcagga agttgccttc actggcaatc actaactcca 47100 gagacggggc ttgcctgcca aatctgcacc tgcaaacaat gaataccctg tcccctgcct 47160 ttcaataccc tcaaagctag tgactggaag ccagagtgct gccacagcaa aactccacaa 47220 tcccgcttac cagtagaaac aacggaagaa cagcttgctt ctcctttcca aaagtcttac 47280 aagggcatct ctatgggaga gcttaaatca tatccaccat gttagctgcg aaggaagtct 47340 gggcaaggtt atgttctact ttcctgcctc tctggtaata ggatgtgcac tagactagaa 47400 gggaggtgca gaagatataa agctagcgaa ttcttaagac cgtgcaatat ttattcctct 47460 gagcctcaga tctgcaaaat agggatatta tctattttgt gggacatttg tgtggattgt 47520 acaagattta tattacatta tattatatat tagaatttac cagctctctt gctttagtat 47580 ttcactgaag atttgtcagt cctgtgatac ttaaaacaaa caaagaaaac acaagagaat 47640 ttaaaaaaaa aaagcaacct tagaaacaaa acaaagatca gggttaaatt tcctcattac 47700 attggtctct ttttctggtt ccactaaaaa tattttaatt tgtctcagtg ttttatttgt 47760 cataagtatt tatagcatca atgagagaaa tgatggtttt caatttttgt aagtaataac 47820 ttgggaacac aagcatcacg cagaatgaca atgactgcca ctttctcagg actagattaa 47880 atcaagagct ttcttttgcc ctcagttatc agtgcagcat atggaaatgt gctaatgtct 47940 gaatgaattt tgattcaaac ctcaaacctc aaccaacaaa actgatcagc atctaactga 48000 aaatttcaaa tggctgcatt tccaaatagg tcagtgtctg tcacatttgt ttcaatgcca 48060 agagagtctt cacacatccg agaatggtag aattcctaaa agatgctgtc tgagtaaatg 48120 agtgacccta ctgcctttct gaatatattt ttgtgtagga ttgtgaacca gcattagact 48180 taggtctgaa gttttaaaaa agcttccaaa agtattgaaa ctattccagt atagaataga 48240 tgtagtgttt ccaaaggatc ctttgtttat ggacatttgt gaaagctgag gagtgctgac 48300 ctatatataa acttattgac atcctattta agatcaagca aggtctgatt ttatacaatt 48360 taataatgat cctaatgaat tctagaggtc aaattatctt ggagtatgat gcaagatggg 48420 gttccaaact atttcactgc attggaaccc ttttgtagta agtgttatga aaatcgtggg 48480 aagagtagga ccctgaaggc aaacaacatg ctttccaatc ccagctgatt ctcttcccag 48540 gtgtcctaaa ttacttaacc tttctgatct ttagactccc cacctgtaag ataaggacaa 48600 caatgaagat gagataaaga tgaggagatg atgatgatga tggtgatgat gataatggtg 48660 atggtgatgg tgatggtggt ggtgataatg atgatgatgg tgatggtgat gatgatggta 48720 attatggtaa tgttggtgat gtgatgatga tgatagtaat aatcatgttg atagataatc 48780 tatatcaagt gattactatg tgcaggcact gttctgggag gctttatatg tactaatttg 48840 tttaatcttc aggacagtct tttgaagcag gtatctgtag attgtcatga ggattaaata 48900 aattgatacc tgtaaagtac ttaacagagt tcttcccaat ggaaaatcct caatcattgc 48960 ctgctattat tactattatc attgggttgt tgtgaattta aaaaaatggg atattatttt 49020 taaaatatct agctctgtac ctggcactct ttttaaaaag tttttttttt tttagatttc 49080 aatagctttt gggggtacag tcgtacaaat ggttttttgt tacatgaatg acttgtatag 49140 tgatgaattc tgagattttg ttgtacctgt caccagagtg tatacatttt atccaatatg 49200 tagtttttta tcccacactt ttctcccctc ctcccttttg agtctctgaa gtctattata 49260 cactctgtat acctttgctt actgatagct taggtcccac ttgtaagtgg gaacacatgg 49320 catttggttt tccattcctg agttacttca cttagaataa tggccttctg ctctacccaa 49380 gttgctgcaa aagacattat ttcattcctt tttatggctg aatggtattc catggtgtat 49440 atataccaca gtttctttat ccactcattg gttgatgggc acttagattg gttctatatc 49500 tttgcaattg tgaattgggt tgcaataaac atacatatgc atgtgtcttt ttatctaatt 49560 actttttttc ctttgggtag acactcacta gtgggattgc tggatcgaat ggtagatcta 49620 ctttcagttc tttaagaaat ctccataacg ttttccatag aggttgtact aatttacatt 49680 ctcaccaaca gtgaaaagca ttcccctttc actacatcca caccaaaatc tattattttt 49740 tgacttttta aaaatgggca ttcttgcagg agtaaggtag tatctcattg tcattttaat 49800 ttgcatttcc ccaatgatta gtgatactga acatgttttc atgtgttttt tggccatttg 49860 tatgtgttct tttgagaaat gtctattcat gtcatttgtc cactttttga tgagattact 49920 tgatttttcc cttgctgatt tattgagttt cttatagatt ctggatacta gtactttgtc 49980 agatgcatag tttgcaaata ttttatccca ttctgtgggt tgtctgttca ctctgatgat 50040 tatttatttt gctgtgtaga atctttttag tttaatcagg tcccatttat ttatttgttt 50100 gtttttgttg catttgtttt tggggtttta gtcttgatgt acctggcact cttacatgct 50160 cagtaatcat tagatgtgat tttgttggtg aattcttatt tgttattcct ctgttcattc 50220 aacctcccca gtctgggtca tacactgttt tcaaaactct ccctcttcct attagttaac 50280 ccaattcttt attcagcaaa tattattaag taccagtttg tatgccggag tctattctaa 50340 gcataggaga tgccttggtg agcaaaccag accaaattct tgccttcata gtcaccattc 50400 taaagcaaaa gcagaaaaat agtctacata actcagctaa aatagacaaa aagagaaaac 50460 agtcaagcaa acaaaaagat taagaaatat aaaatcagta aaagagtatc gaaaagaata 50520 cgtagaaaaa tgcaatggta ataaaatgat ttccagggaa tatgaactga catttcacat 50580 aaacatatag gtgatgaggc aagacaaact aagatgataa aaactaaatt taaacaagaa 50640 aaaatgttta aagcaaacat ataagcaata tgtataagta gtggttaaaa gaaacgtctt 50700 aaagcataaa ttactaaatg agaaaagtta taaatctaca ctcaagaaaa aaatggaaat 50760 aggcatttaa gcagcagttt ggaggtgaaa ggtttaaccc cacagagcaa acgaattcac 50820 ttgggctcag gtagctacga gcagcatcag gcacctccca gtgctggaat tcacctatta 50880 gaaacccaga acacccctag gaacctcagg gaagatgctg gggaccaggt gccggtcact 50940 gtgggccact agcttctggg ggtttgccta cctggcactt taaatggtca ctgaaggtat 51000 ccaaagccaa actccagcac agcttggctg tcctcatcat aacccctgtc ctcatcctct 51060 cagcagaaac ccagtaggtc tgatggtgtc agtgggtttt tgctttctac tcctttagcc 51120 tccttcatca cttaataagc attttgtcaa agaaatgagt gcatgtgatt ccaaagtcaa 51180 atggtaagaa gggcatatca ttagaaaaaa gccccactcc taccctcttc aactgctcct 51240 agaggcaacc cctaagaacc acttcagttt tctttgcttt ttgtatttta ttgtttcttt 51300 ctttgttgtg tggatagtct ttatttattt ctgtagtacc atcttccctc ccaacaaaca 51360 tggaagaaaa aaagaagggg ctggtgggaa gaaatgccct gggggacaca cctctccaat 51420 ctctgcagtt ttgaagaggg agtgttgagg tggaagtctt ggggatatgg ggaagatggc 51480 cagcccctcc tcttgtctcc caggtggctc ttggcagcct tttaaagctt aaactcatca 51540 gttattttgg agacgatgct tttaaaagac ttggaagtct gggatatatg cttaaatgga 51600 aacttcttga gatatctcca aagcaatttt cacacctcca tcttccactg tactgagttc 51660 tcttggtccc actgtggctg catgcacagc agcatgcagt tcttgtacag tctctggagt 51720 ctgaaggagt tccattttgt cttatcaatg atctgcacag ctacctcttt cctagttagg 51780 gtgtgtcagg ccaactgcat ggtgaccaag ttaccctggt tgatggtcgt gatgagccag 51840 tagttgccaa tatgggtatc ctcagcagag atgcctgaga ggtcctgcag tacgttggag 51900 atagggcttg gagtcaaggt gtcccgaggt aggcttaaat ttgggaaaag ctgattaaaa 51960 gcttggtcca aatcatgtca aaagaatata acttccaaaa tccacctgat gacctaatct 52020 ttataatgaa atcaaaacac aatgccagag aaaaataaaa ataaacaaaa acaaatgcta 52080 acacactaaa tttaaaaaga aaaatgagaa aaaattctta caaagtcttt gtatagtttt 52140 ggtattaggg taatgctggc ctcataaaat aagttagaaa gtgttccctc tgcttctatt 52200 ttctggaaaa gattgtacag aattggtatc atttcttctt taaatgcttg acagaataaa 52260 ccagtgagac catctgggcc tgttgctttc tgttttaggt tattcattat tgattcaatt 52320 tctttaatag tataggccta ttttgatgac caatttctcc ttgttgagtt ttggtagggt 52380 atgtctttcc aggaatttca tctaagttag caaattagca ggcttggagt tgttcatatt 52440 tctttatttt tcttttaatg tccatgggat tagtaacgat gggcctctcc ttcatttctg 52500 atattggtaa tttgtgtctt ctctctctct tttttcttgg ttagccttgc tgggcttggt 52560 tagcctagcc ttattaattt tatagatcta ttttattgac tctattaatt tttattgatc 52620 tatttaaaga accagctttt ggtttcactg gtttttctct atcattttct tactttcaat 52680 ttcattgatc tctgctctaa ttttaagtat tttttttttt tgcttatttt ggatttaatt 52740 tgttcttttc tagtttccta aagtggaagc ttagattatt gattttagat cttaatcaat 52800 atatgcattt taatgctata aatttctctg taaacactgc ttttgtcaca tcccataaat 52860 tttgataaat tgtattttca tttcccttta gttcattttc acttagttct ttaagtagtc 52920 acctctatgt cttctatttg cattatgtgt aaccttttag aaacctttgt gatgtaagat 52980 atgttatatt cctatatttc tctaaatcaa agttaatctc ttctaagctc acaactctcc 53040 cggctgatgt gaaattcatc tggtctatgt ttgagaatac tccttcctcc aggatcgtgc 53100 gcaaacctca ggatcatgtt caactttgaa ttggtccctt caaagtgaac aactcagcca 53160 cagtccagca tggacatctt accccagggt tttcaaggcc atggttctta gtatcttcct 53220 gtccttgtct cagagctaca tcattagaga tctgtcactc ctctagaagc tctttattag 53280 tgctggttca atactcacag aacctcccag actcatccca ttcccaattc tacaagcatt 53340 gcccttcacc tccaacacct ttgagatgat gcaatataac cgtttcaaag agaggtcgaa 53400 aacaaaaatc ctatgtgaga gttatcttta tttagagaca gaaaaattgc agcttttgtc 53460 ccagcaagtg aacaagagtt tccgccctct tgggatggaa aagaaaatga aaaagcaata 53520 gaagcaagaa aaaagcacac ataaatacct tgatcagtta tcctttcatc ttaatgctcc 53580 tacaattatt gcaaaacaat catttatgaa ctcaaactgg ggacggtaac aaatttgtta 53640 tgggctgggc acttgccatg ctgtgtaaaa cctgggtaga ttcatgtctc aagataattt 53700 ttaaaaatgt aaacttccta ctaaaaggac agcataaggc agtggttaag agcacagagt 53760 caggtgggca cagtggctca tgcctgtaat cccaacagtt tgggaggttg aggcaggagg 53820 attgcttgag cccaggagtg agagacctgc ctggacaaca tagctagact cagtctctat 53880 aaaaaattaa aaattattaa tagctgggca tggtggggta cacccatagt cccagctact 53940 caggaggctg aggtgagagg atcgcttgag cccaggagtt tgaggttgca gtgagctatg 54000 attgcaccat gcaatccacc ctgggcgaca gagcaagacc ttgtctaaaa aaataggaaa 54060 taaaaagagt gtagggtgtg agcttatcaa ggttcacatc taaactctac catttgctct 54120 ctgcaggacc ttgagcaaat taacttcttt gtaactcagt ctcctctttg gtaagaaaga 54180 gataatagta tgtgatgttc cccttcctgt gtgcatgtgt tctcattgtt caattcccac 54240 ctatgagtga gaacatatgg tgtttggttt tttgtccttg cgatcgtttg ctgagaatga 54300 tggtttccag cttcatccat gtccctacaa aggacatgaa ctcatcattt tttatggctg 54360 tatattatcc ctaaaactta aagtataata ataaaaaaaa agaaagagat aatagtagta 54420 tctgcctcat tggattgtta tgaataggaa atgaattgcc atttaccatt ttagaacagt 54480 gcctggcaca aaataagtgc tgtataagcg gagacagacc tagaagctgt ctctatttag 54540 gtcatcttga atggaaggat ctctagttac tcacctctgt gtgtttgctg ctattttaaa 54600 agcctgtaga cccttaaagt tttacaacat caaccaatat tttattgatt gcaaagtaat 54660 gggtggccta ctaccagtgg ctatcagtat cagtcactgg agtctggcac gtgcatgtgt 54720 atgtacacat atatgtgtat gacttgagaa tagggtgtgg tatgtggctt ctaccagtat 54780 aaagaagcat gagaagtgcc tgaactcagg aagcagaggt tgcagtgagc tgagatcacg 54840 ccactgcact tcagcctggg caacagagtg aggctccatc tgggaaagaa aaaaaaaaaa 54900 gaggcagaaa agcagtgtgc aatattctgc agaatcgagg aatccatgta aaagtaccat 54960 cctatagcta ttttcttgct accccaggga tcccattggt gctacacatt ccccagatgg 55020 caatgccaac cctgtctact tatcctgtat taccagcaag gctctgttcc tcagtgaatt 55080 tcctttccat tcattttcaa atgacttttc tgatatgagc attgttgaat atgcacaaga 55140 aagaggcaga ggtatgtgta tggatagatt tgcataagta taaatacagt atatgtaccc 55200 atatttaaat gatataatga taatattgtg ttgctattca gagtgaaagc taattaattc 55260 agctattgag atgataatga tttctttttc agttaatacc agagtcaaaa aaccctgcag 55320 aatggttcta cgcactgttc ctttgactca tcttggaaga aagaccttta ttctgttaac 55380 tattaagctg tctatcagta tctatgacag ttttaaagga aatcattcct gggaggggta 55440 gaagattatt agtaggaagg ctgactcatg taggctcaag tggacattga ggtcatgatt 55500 tggtaccagg aaagaaaaag tcctcttttt gcttttagtc tcttaaaatg catgctttgg 55560 tatttgcact ctcaatatca acactgatct gcagcacaaa gtcttttttt agactttatt 55620 tggttggatt tgacatttag aaaaatgtta agatctttcc tactgaaatt tacaccagct 55680 gaaaaattca caagcaagtc catgactttc ttaaccttga catctctaac ttttccttca 55740 gcctctgttt tcatatttca tgagttacca aaacctgatg ttctctctgt ttcctccttt 55800 ctcttctgtc ccactttagt tcaggcattg gtagcctctc agttgtatta tggcaatagt 55860 ctcctctctc tagtctccaa tacgcgttcc acctttacct tgctccgagg agcttcccaa 55920 catacaggtt aggtcatact gtcccctgtc aaagccttta atggcttccc tctttcaaca 55980 tggtaaagac tcaactcctt tgcttggcat tcaaggacac ttaatcacat tctgttcctg 56040 ttacaactgc ttacccatct cccactaatt ctctccatac tctacccaac cgccaccaag 56100 atggagtatc taccctttct tggaattttc ataacaccct gtgtttgctc atttttccga 56160 atgtctttcc tccctcatgg tccatgtttc taaaacctta cttattcttc aaagcccaga 56220 tcaaattcta tttccttcat ggacccctcc ctgattctgg ctactcttaa ttaatcaggt 56280 tcactcccag tctatacaat gccttttctt cggggcgctt gacttctagc agttgggaaa 56340 tcctcatatt atacttatct tgctagaaca atttttaagt gccttctgct ggaaaattgt 56400 cataagaaat atgcagctct acatgggtgc agtgtaattg ctgcctctcc tccttaaatg 56460 tggcacaatc tttacaagca tatggtgtgg acctgcttct acatttcaat aatttattcc 56520 ttgtttagct atctcatcac tcatatttta tattatgctg taaatttact tctgtttatt 56580 ccaactaagt atgcaactct gaagacaagg aatgtcttat tcatcttggt atttcccagg 56640 tcagagaatg cagctttcag atagttggta cttaacagta aactatttta ctctcttgta 56700 acccaagggt ggcaattgca agatagtaaa gagtacaggc tttagtgcat gactgcctgg 56760 gtttgaacct tcctagccaa ctactaacca tgtgatcatg gccaagatac ttaaactctc 56820 tgtacctttt tttcttcatc tataatatgg ggacaaataa tactatcttc atcataaaat 56880 tgatataaga attacatcag taaatggagg taaagtagct caaacaatag ctactaggtg 56940 cttaataaat gttagatatt attccaacag tctgttaaaa taggaaggac tcctctcaat 57000 tttgacagag acatgaagac actagatgct acattccaag gtgagcccat ctggcagaaa 57060 tttcacagtg gatattcaag aaccagttaa acatctcact tattttgagg gaatgaagag 57120 atcaggagtt ggaaattata gagaggtctt ataaagattt tggagcatgt gactagtgtg 57180 ctgtgtgaat cttacatccc tactgcaaag gagaatgttg gaaatgggag tgtctgtccc 57240 tcgggccacc actggaacgt gtagtaactt tgtgagaatt ggaaaatggt gtcccctcat 57300 gcccccaaaa tgtatactaa agaggtcctg ttgccccaag cccaatgaaa gagtcttcaa 57360 gacgatcact tgagtaggtt gacatgtatc ccctgaagct gacagacatg gggactttgt 57420 gtctcacttt cacccgagaa gttgaaaggc aggaagccgg gtggctgact gctcatggat 57480 aaatgggaga gatttctctg ctccacagag attgttagat cagagaaagc cattatgggg 57540 ttatcttagg tccatgtgaa gagaaccaca tagagtgacc tcagtgtaaa aattgtggag 57600 tgaatcgcca gattcctaga tgttgaagga gacacatacc atgggctaga atagatatgt 57660 atttgccctc atcaaaaaag ttcagatgag gctgggcgtg gtggctcatg cctgtaatcc 57720 cagcactttg ggaggctgag gcgggcagat cacttaaggt caggagtttg agaccagcct 57780 ggccaacatg gtgaaacccc atctctacta aaaatacaaa aattagtcgg ccgcggtggg 57840 cggcgcctgt catcctagct gctccagagg gtgaggcagg agaatccatt gaacctggga 57900 ggcagagact gcatgagctg agattgtgcc actgcactcc agcctgggcg acagaacaag 57960 actctgtctc aaaaaaacaa aaaacaaagt tcagatgaaa gtatcccgaa gatgagaggt 58020 ctctaaacaa cccaacccaa tggagaaaga gacaactttg aaaacttgct atgcacaaac 58080 cagcttacag aaatactagg cgagtaactg ccttcctgtc atttcttctc ttacctctca 58140 caatcctctc ctgaattcaa cactgcctcc ttgactctga aagtgtcagg aactgtggtt 58200 agtcagatgg aggagaaagc agaaagtggg gaaatggagc agcagagttg aagtcggtta 58260 cctcccttct tcactctagg catcttacct gttaacaggc tgcagctgtg ctataccttc 58320 tcgggggaga tggttgcttt tagctcaagg ttggcatttt tattattatg ccaggctggc 58380 atataaagta ctgaattgaa accttgaatg cgattttctc cctgaatgcc tggttaacag 58440 caactcactt tccaaaactt agctcagtgc tcactcccta tcctgacctc ttcatcccaa 58500 actgaggaat ctgtgtgtct gctctgccgc agtgcagctt ttagagccca ggcaacatgt 58560 aatttcactt gtttagttgt ttgtgcgtct ttgtcctcag ccagccttta aactcttgga 58620 gcccaggggt gaattgtttg cctttgtacc ctctgcactt agtgctcact ggcatagaat 58680 tgtcccccaa taaagatcta tgaaagtaat ttgtcttaga aaacaaattc acatttttcc 58740 taagagaggc tataattcaa atttgtcttt atttctgaga gataaattaa aaacaacagg 58800 cttctcaaca acacaatgtc acataaagta tttgtcactg gtgtgctcaa ggatggcctg 58860 ggtgctctcc agcagttggt gagaaaatgt cctggcatct ctcaatccaa acctgttttt 58920 ttcctgagaa gctgtgagag gactttttac acttttcagt gtttctccca caggtttctt 58980 caggagaagc attaccaaaa acgctgtgta caagtgtaaa aacgggggca actgtgtgat 59040 ggatatgtac atgcgaagaa agtgtcaaga gtgtcgacta aggaaatgca aagagatggg 59100 aatgttggct gaatgtatgt atacaggtat tcacttcaag caattacatt tcactaaaaa 59160 tctcttaagg aggcagatgt caggtaactc atcacgagag tgctacttca catattaaag 59220 aaggaaccta ctaagccatc taagttcttg aaaagggtaa tgaaccactg aaattgtcca 59280 aactgtcaac taaagagcat tttagtcacc cgaaagaaat gggttttttt tgttgttttt 59340 ctttgtcgca gtgaactata catggagccc agtggcatta acataccatt caaataatgt 59400 gctttaatga gctacgaatg acatctaaaa gtttcctcta aatttgattt taatatactt 59460 attatttatt gcttaattag catacaaatt cactattgtg gaaaaaaatg agaatgcttt 59520 ataaggtata actgcctctg aattgtctaa atagagttct aaaccaagtc attttattaa 59580 ggatctgtct tctcttagtg cagctttgaa ccaatttgta taggtttaca caggaaatta 59640 actactccca gttttaatat aattttgtct ttctatctga taaagagaca ctgtattaac 59700 tgctggcttt gtgaggaact gtaatcttca tcaaaaattg gaatgcatgc tagtatataa 59760 cataattaat ttttcattct gcaaattcta cagaccactc agattctgag atatgtcctg 59820 tttctcctca aaacacaaat agacaaatgt caccaagggg atgacagcat tcaggcaggg 59880 aatgccaaat gaaataaaag ccttcagaag gtgactccag ttctattgtt accaaactgt 59940 acgggcaact gggatggacc cagggaaata atgcatgcag gttgggggaa aataggtctg 60000 tgtaaataat ccagctctgg gtggatgaaa attttatagt agctgatcac aatctcatct 60060 tagcatacat gaaatgagaa cctgcaatag gggctgctgc tggaatccaa tcttgcaatg 60120 gaaacactta ctctgacact gcacagctca tatagctcat cacttaaata tgaacttgtt 60180 ccagctggag cgctgaggac attttgcatg aagccacata tagcagtcta taatatttat 60240 gaacacaaac tcatccagta acagatgcgg atatttcatg gttctataaa ataaattccc 60300 tgtacaaaga aattcctgga actttgtttt attggcattg agaaaagagt aaggacgtta 60360 aatgggatga agggaagatt gttttacatt atttttaata gctctctttt tttttttttt 60420 gttgagacgg agtctcgctc tgtcgcccag gccggactgc ggactgcagt ggcgcaatct 60480 cggctcactg caagctccgc ttcccgggtt cacgccattc tcctgcctca gcctcccgag 60540 tagctgggac tacaggcgcc cgccactgcg cccggctaat tttttgtatt tttagtagag 60600 acggggtttc accttgttag ccaggatggt ctcgatctcc tgacctcatc cacccgcctc 60660 ggcctcccaa agtgctggga ttacaggcgt gagccaccgc gcccggcctt aatagctctc 60720 tttaagtgta aatcaatgag gttgattgta tgatcattca aagacatttt aaaagtcacc 60780 ttgtattatt tttattttta actgtattct caggaattgg cactgttctt aaaattaaca 60840 ggattttaaa aatttcccta aaggatgtta acatttgttt ttctcaataa gtttatatgt 60900 atttaatgtg aaattcaata tccttcttct ggtacagtat gtaaacatgg atgaaaataa 60960 tattaatggc atgttttaaa acaccaattt tcacagtccc ctagagcagg aacagggaga 61020 gtgtcattca ccttttcatt gactgaatcc cttgcatttg tacaatgctt tcatatcaca 61080 agttctttat ataaagctca tttaaaaatg attattgctt ttattttgtg tagattattg 61140 cttttaatct acacaaaact ctatgaggta gatattattc ccattttaca aatgaggaaa 61200 ctgagactca cagaggttaa actacttgtt caaggttaag tctcaaacct tggccttccc 61260 tttctaaaat gcataaaagg ctagcgttag ttctgtgcca ttgcataggg gatcttctgg 61320 gccaggtaca tatcagtggt tttatgacca cacacccaac agtactttct gtgattggtg 61380 aagtctctat gcttatttgt tttaggcttg ttaactgaaa ttcagtgtaa atctaagcga 61440 ctgagaaaaa atgtgaagca gcatgcagat cagaccgtga atgaagacag tgaaggtcgt 61500 gacttgcgac aagtgacctc gacaacaaag tcatgcaggg taataatatg caatggtgtc 61560 tgccaagact ggcaggaact gagtttctag gtacatagtg agctggccag gaggctttca 61620 aattaaagcc acaggcacag ctgaatttct agtccaattg ttcattaaaa tggattccta 61680 atacatgctt tgatgtaaac attcaaatag tagcaatgca taaactaaaa agtgaaagtt 61740 ttccttctct ctgactccat tccagtccta ctacctagtg tttgccagca gaaatgattt 61800 ggtgtataaa atggattttt tccaactgag tattgatgac cattttttcc ctctttttga 61860 gaagacagtc tcaaactggg aagaacatgg tctcaataga ggctgtggat taatccttat 61920 aaatgagtaa aacagggtga atacacgtat gtggatgaat agagaaaagt cctacccact 61980 ggggttacaa atttctcaag aaatccattg agcaacttag gacaaaatag gggaacagca 62040 tccaaggtgt ttagacacac acattgtgat aatgcacagg acatttgatg gaattttatt 62100 ttgccattag ctgaaaaacc agtaagctat gattgtccaa accacaccct tctcaatatt 62160 tattacatta ttgacaaaga aaactttatc cacatttagt tttacaaata cgtatttttg 62220 tatttattta agaaaaagta ttttatatga aagcattatt ttgccatagt gcttcatcag 62280 taacactttg gtataagaaa acaaagagtt gacataatta ttcacactct tcaaaagatg 62340 ctgtagaggg agaaagagat gtttggggtc tgaattgtcc agattgttga ctctctattt 62400 taggggttga caaacttttt ttctttaaag ggctttggcc acatggtatc tacaacagct 62460 actcaactct gccattttag tgcaaaagta gcagagacaa tacataaaca aatgagtgtg 62520 actgtgttcc aataaaactt tataaaatcc aatggatttg gtccatgggc cacagtttgt 62580 tgatccctcc tccacatgaa tactctcaga tgacttcacc aggtggaatt catgtagaat 62640 agtatatgaa tatagtattc atagtagtca tactatatga aaacctttca ttagttataa 62700 ttaaagcttg aactttcagt gaccttcaat atgaatgggc tcaatagtct tcatatccag 62760 actaccattt attcctaaga aatacacaag tggaatattt tgttattcat attttataga 62820 taggaaaatt cagtgaaatg tgatatggcc tgcctgagga cccagagaaa gaaattatac 62880 actctcttag aactcatagt tcctgctgta tttatgcctt ttttttttta tgatttctgc 62940 tattaggtcc ctccagatga atgcacatat agaaagaagg ctttatacac atcttcccta 63000 acacctttta ttttcttgcc agtgtaggag aaaactgaac tcaccccaga tcaacagact 63060 cttctacatt ttattatgga ttcatataac aaacagagga tgcctcagga aataacaaat 63120 aaaattgtat gtataatatc tgaaaatatg tgggtttaaa gttaatattt tctggagttt 63180 ttattgcctt ggaggtataa tttatatgtg aatatgttac tttgaaatga ttagagctga 63240 gaattcaagt tagactttta aactctcagc aacattaaac aattcaagag tatgaagtct 63300 cccacctatc ttgataatgg taattgccct ccaaagatct gagaaatagt aagatgggtt 63360 ttcaaatttt atcatctaaa cctagttttt ctttagtcta atggttttat attatcattt 63420 tcttatctac tttttaatca ttggtttttt tcttaaaatt tagttaaaag aagaattcag 63480 tgcagaagaa aattttctca ttttgacgga aatggcaacc aatcatgtac aggttcttgt 63540 agaattcaca aaaaagctac caggtatttt ttaaataatc aaagttaata tttattgaga 63600 gtttaaatat gtgcccacag attagattac ctattttaca tacggtgttt taattttcaa 63660 aacattcctg tgagatcagc tctattttca ctattacttt gccaagtatt ttcacatgta 63720 cttatttcac tgctattctc tacaatagtc ttgtgacatt gagaaaggca ggtctgttct 63780 ttgtaaaatg aaaatcattt aatatctgat ttaaagtaac tgtcgaacta ctatagacat 63840 aagatattag aactagaaag gatatttaaa ataatgtata ataattcttt caaatcaata 63900 gattaaaaat tgaaactttg caaagttgtc attcacccat ttattccttc aataagcaat 63960 tgttgatacc tactctatac taagtgcctg gctattacct tcagggagag tgatcaaagg 64020 tgataataat aatgataaat acacatacat acacatacac acgaagtgct gtttgaacac 64080 agaggaactg gttgctaatt gtgtctagga aagatggaga gacttcaggg aggatcatat 64140 ttgagttgag tctaaaagaa aaagaagttt tccagttggc atggagaagc agcagcattc 64200 caagttgaga gatctgcttg tttttttgtt tttgttttgt tttgtttgag acggagtctc 64260 gctctgttgc ccaggctgga gtgcaatggc atgatcttgg ctcactgcaa cctctgcttc 64320 ccgagttcaa gcaattctcc tgcctcagtc tcctgagtag ctgggactac aggcacacac 64380 caccatgccc agctaatttt tgtattttta gtagagatgg ggtttcacca tattggacag 64440 gctgaattca aactcctgac ctcaagtgag ctgcccgact tggccaaagt gctgggatta 64500 caggcgcaag ccaccatgtc tggccgagaa atctgcttgt tataaggcac tgaggcatga 64560 acaagaatag gattatttga gataggagag aaaattcagt gtggttatag tatagtgtct 64620 gtggtgggct cctaggttag acaataaggt aagttggtac cagactgatg aaggcctaac 64680 taaggaattt aatctgtgta ccctggacag taaagagatt tcggtggttt ttgtagcaga 64740 gggaaatgaa gtttttgtta tacagaaatg acttaatcag gttcatgtaa gatgaatttg 64800 aatggaaagg gtctgggggc agggaagatc aagtagtatg ctactgcaca agtccaggca 64860 acagggcttg aaataaagca atagcagtgg ggaagagaag aagaaacaaa aagaagagag 64920 attcaaaaat cgatagggct tgaggaacag ttgtatgtgt aaggtgaggg agagggactt 64980 acttatccac actcattctg atggtgccag tatagaaact agaatgtggg agtcttctta 65040 tatcccagcc tcaaattctt tttgccacat tattaatgct atctaagcta atgggtttta 65100 gttgtttcag tgtttattaa aatattggtg tattgaaatc ccgtagtcta gaagagtata 65160 ataaatacct tgtatagcag cataatacta taatattaaa ttatagaaga taacttctga 65220 cataggattt tagttaaaca tgttaatttg ccttcttttt gccagcatta tgctattatt 65280 ttattaaatg tgtcacagat agacataagc ctacataaaa ctcataaagt gtgaggagaa 65340 atttctcact atcattgaaa cactaatatg ctctcatttc tttattaaca atgaaagagg 65400 aacttaattt actatatgct aattaaactg aaggcaagct caataaaaaa ctggattata 65460 taagagtaaa aactgtggag gaaagaactg gaaaaaggag atatttttgg ttatttatgg 65520 gaagagattt tttaagttca aatggtcagt tggttttgaa tcaagaaata attgaagaca 65580 aataaaaacc ttattcataa gaaggtagct tatctgaaag caatagcaat atcttggcat 65640 ctcttaattc ccagtgccta gcctcgttcc tgagggagca agtacaatta gtgtgtgttc 65700 aatgagtaaa gcagaaggaa aaaatttaaa atatggaaag aaataaaagg caaaaaccct 65760 tgagaaggaa aaaaattgac aataatgaca gagatagcat tattacaagt agtaacaata 65820 gggatataat accagtactg aagggatatt ttgtgaaaaa ctgaattgaa tcattccatt 65880 cagtattcat tgagcacaac ctgtgccagg cactgtccaa ggtgctggag tccctgaggg 65940 caataagaca gggccccttt cctcaaagaa gctcttagat tctctcggat ggtggctcaa 66000 tttttagcgt tgctctcttc tctatactcc ctttgctatt tagttcttcc caagccattt 66060 aggccctgat ggtgttatct ctattcatgt gagtgtatgt gccagcatgc acatgagctt 66120 gtgttttgtt tcccattcat tcattcaatc tgtcaattat ccaacaaata attcttaggt 66180 acctattatg tgccaggaac tggggataca aaagtaaaga aaaacaaggc ctctgccccc 66240 aagaaatgta cactccagta gaggtggcag acaattaaca aataagcaaa taaatctata 66300 atatcatgtc gggtggtgat aagttctata aactaaaata aagcaggtag agactagtag 66360 tggtggggag tgtattttag attggggaat gctggagcag aggcttgatt gcaggaagga 66420 gcagggcatg gggcatgggg ttatctggga agaacattaa tctctgaggc aaaaagaggc 66480 ttgtgtgttc acagcagggc caggtggtca gtgtggctgg aacacaaaga gtagagggaa 66540 tgaggagtag aaaatgagtt cagagaagct gttgggactg tactatttaa gcctggataa 66600 caactttgaa ttttccaggg agtgttatat tccatcaggt tggcaattct taccatattt 66660 ttctgatttt cccccaccct gtgtgtcgca gtgctgagtt cagagcagat acatcaacat 66720 gactttggtt ggctgttttt gggaaatact acccgttctt tagcttgact aaagactcat 66780 gagttagcca aatattctgg gtgcatataa agttttcttc tggcttgtag atgcttccaa 66840 tctgcagagt tgaccctgtc acatcaggtg tgactaacat aatggctcaa agaaaatcta 66900 cctgtgtcta gtttctgagg gtttcttcat ctgaatcagt gtatctatgc cataatatat 66960 gtatctactt ggacttgaca tctccctttg cccagcttgt ttaataaatt ctgctcaatg 67020 aagataaaag cctctatttt attggcgagt acaaatggac tcaactagac tacccaattt 67080 taaacaacta cagaatcact tgatgtaaat gttttatcaa tggcaatgat ggtgatcatg 67140 aaatattgtt actccttgat accaatttga ttatcatcat tacctaggat ttcagacttt 67200 ggaccatgaa gaccagattg ctttgctgaa agggtctgcg gttgaagcta tgttccttcg 67260 ttcagctgag attttcaata agaaacttcc gtctgggcat tctgacctat tggaagaaag 67320 aattcgaaat agtggtaagt gatttggcta atggtaaaag agtttgtttc taggagtaaa 67380 attggtgtgc ttcatgaggg tggggctctt gccaatctta tgcaatgtta tatgcctgtt 67440 gtgcctagca tgaagaatgg ctctaaaaag atatgtgtta aattgataag acttctagag 67500 gcttggttca aatatctctg ttgtaataac aaagtagcaa aactacccct ttactgcact 67560 aagatgaaga gccaaaggaa gactttttta aagattggaa aattcatctg attttgaaca 67620 actagcacaa aggagctagg attcttcagc caagaaaaag attatttaaa atcttctggt 67680 atgtaagact aagtatccag ctattatata tctgtatcaa aataaaataa gaagacattt 67740 taatagcaac atttatatgt catttactag gtctcagaca cttttctttc ttttttcttc 67800 ctttcttttt tttttttaga cacagtcttg ctctttgccc aggctggagt gtagtggtgt 67860 gatctcggct cactgcagcc tctgtttccc aggttccagc aattcttgtg cctcagcctc 67920 ccgagtagcc aggattatag gcatgtacta ccacacccag ctaattttgg tatttttagt 67980 agagacagag tttctccatg tttcccgagc tggtctcaaa ctcttggtct caagcgatca 68040 gcctgacttg gccttccaaa gtgctgagat tacaggtgtg agccaccgtg cctgactaaa 68100 gttaactttt tttttctttt cttttttttt ttttgagaca gaatcttgct ctgttgctta 68160 ggctagagtg cagtggcatg ctcttggctc accagtttcc cctgggttca agtgattttt 68220 ttgtgcttca gcttcccaag tagctgggac tacaggcgca tgccatcaca cccagctaat 68280 ttttgtattt ttagtagaga cagggtttca ccatgttggc caggctggtc ttggactcct 68340 ggcctcgagt gatccaccta cctcggcctc ccaaagtgct aggattaaag gcatgagcca 68400 ccacacccag ccggtttcag acacttttct aaatatatga tttcatctaa tcttctcaaa 68460 tattctatga ataaatacta ttatcattcc catttttcaa atggaaaagc tgagacagag 68520 aaaggttaaa taactttccc aatgacagag aggaagtctg acttcagaag ctttgtgtca 68580 gcccaatgca gtggctcacg cctataatcc cagaactttg ggaggcagag gggggtggat 68640 cacttgaaat caggagttca agaccagcct ggccaacata gtgaaactcc gtctctacta 68700 aaaatacaaa aaattagctg ggcatggtag tacgtgcctg taatcccagt tactcaggaa 68760 gctgaggcag gagaattgct tgaacttggg agatggaggt tgcagtgagt caagatcgtg 68820 ccaccacact ccagcctggg caacagagtg agactcagtc tgaaaaaaaa agaagctttg 68880 tgtcttaacc actatgctat agtacctctc aagagacgag ataatagcac aagttactca 68940 gtgtttaact tattgtggaa taacactgtg acagggaagc tcaagagaca ttgagatgaa 69000 ttaccaggaa ttctgtgcaa tttatcttag gagatcctta agataggatg agttcttatt 69060 ttcccgagag atttcatata tgggttggga gtggggaatg aatgagatga cttctcagga 69120 tgttttcttt ctaaagtctt atcattaaca aaagcaaatt taaagacttt taaaaccgat 69180 attttaagat agagattaga gggaaagttt tcaggtagtt ctgcatctgt gatataagtg 69240 aaataatttt ttaaatggaa aattattttt aaaggcagtt tctccatagt cattatgttt 69300 ttttcctagt taaaaacaaa aagcataatt agcatcattg gtagatttaa aaaccacaca 69360 ctttaaaaag atgcattatt cttaatatat taatttacta gtctaacaaa tatttaaggg 69420 caaggtacta tggtagcagt gaagagagaa atcaagcatg ttctttcccc taagaagctt 69480 acagtctaat gggaagagat taactgtcca tgattatgat aaaatatgga agagaaaaac 69540 agagagatga tgactcctag ttgagagagc tccgaaaaag ttttattggc atttgagctg 69600 ggacttgaat gagatttgat ttggatatcc agggtgcaag agagaaggac cagcaggagc 69660 aaagagaaag aggcaagaaa gctcagggaa tgcacacaaa gaagggtgag tagctgagct 69720 gactgaattg agggttacat taaagggaaa tagaaaggaa taaaagaatc agatgattga 69780 gggccttgga tgctgggcta atgattttgt tatttcccaa gtaataggag agatggaaga 69840 tttttgagca gaggagggaa aatgtcaaaa ttaagcttta caaattttct agcagtgtta 69900 gggtaagcca gaacagaagt cacctcttag gaggcaagaa gtctcaaaga tgtcacgctt 69960 gctgccagga gaaatggaga gaagagaaag gatctgggag atatatcttc gtggcttatc 70020 aacttttgaa agacattatg ttgtcaagaa atatggattt tggttctgga cagattgtgt 70080 gtgtgtgtgt gtgtgtgtgt gtgtccaggc taaggggaaa atgttgaatc aggtaacctc 70140 caaggcacca ttcagttcta caagtcttac tgcttttcct ctcttaaata attcaatttt 70200 cctgatgggt ggtttagcta gtcactaggc cagctactga agtatcctga ttttaaagcc 70260 ataatcaata attgacagtt caacttagga aagaattaaa ggacaggctg aattttcaca 70320 tgtatgggta taattgttgt tttagactat tgaacataca ctggtggttc tctctttatg 70380 cagaaaacta caatggcttc aacaggttca gcagcttagc tttgtcattt atttgattct 70440 ggcttccatg gactgtggcg gtgctccctt taagaaaatc ccagttggcc gggtgcggtg 70500 gctcacgcct ctaatcccag cactttggga ggccaaggcg ggtggattac aaggtcagga 70560 gatcgagacc atcctggcta acatggtgaa accccgtctc tactaaaaat acaaacaatt 70620 agtcaggcgt ggtggcgggt gcctgtagtc ccagctactc gggaggctga ggcaggagaa 70680 tggcgtgaac ccaggaggtg gagcttacag tgagccgaga tcacgccact gtactccagc 70740 ctgggggaca gagtgagact cggtctcaca aaaaaaaaaa aaaaaaaaaa tccatcagtc 70800 aaatctaatt cttacaaaac taataaaata aaaaggaatg atgactcatg acaaaactta 70860 ttcattttac tgacaggtct ctactctagg gaatgtgtgt ttggtttatt catcaagaga 70920 cagtggtgat taaggatact gactgtggaa tctggattga atatgggccc cactacttat 70980 tagctacatg tcccattttt ttaaagctgt tgaaatagac ttcataacca atttgatctc 71040 tttgaaatca tgtttaactt taatatttat tctgcagtct tgcctcaagt ttgataataa 71100 aatcttgata agacagggtt ttcttctttt tgagtcagta ttttatccca agcacctagc 71160 acatgcctgg cttggactgg gtgctccaca aaattctgtg gaataacgaa aaaaatgaat 71220 gaaggtttgt ccggagctat aaaactggtc tgtttattta cttttcagat ggttgtttcc 71280 ctgaatatca gagttgttaa tgtccaaagc taacaagtaa ccaacaaaaa gttggttaaa 71340 cattcttcta gacattgcct gtagtagtta atttggggaa cagatatctt ttttgcattt 71400 gagtgtaaga aaaggaaaaa gacagtttgg atatggaagt tctgttgtgt tctctctcct 71460 cctcctcctc aaagatgagt catttaaagt tgattcaggt gccagacaat gaaaaagagg 71520 ggtgcaatgt ctgccatatg aattgaaatg ttttgatgag agggcatctg caggagaatt 71580 atctggtggt tgttctatct ttctttctct gtctcttttt ctctcctgga tgctcagctt 71640 cctttaaaaa aaaatctttg cctggctcca tcagatctcc tgggtcctga gccaagtcag 71700 atgctgccat ttccatgatg accactgggt tagtaccaga ttccttcccc agcctggctg 71760 aggtggggaa aatcccttcc actgtggttc agctaaaagg ggaaaaatag tcttccaaaa 71820 aagtttacaa ggctactaag aggagaatta tgtttataga aaggcatata taaggctgag 71880 catgctgact cacacctaca atcccagcac tttgggaggc caaggcagga agattgcttg 71940 agcccaggaa tttgagacca gcctgagcca cagagcgaga ctccatcttt acaaaaaagt 72000 taaaaaaatt agctgggcat ggtggcatga gtctgtaatc tcagctagtt gagaggctga 72060 ggtgggaggg tctctagagc ccaggaggtg gaggttgcag tgagctgaga ttgtgccact 72120 gtactccagc ctgggtgaca gagtgagacc ttgtctcaaa aaaaaaaaaa aaaaaaaaaa 72180 aaaaaccaac cggggggcat atatgcgtaa tgtctggatc tattaattac ggcaccagaa 72240 agataattat tcagcaccta ctgtgtgtca ggctgtatta tcagtgtttg tgtgagcttt 72300 gggtcctaca agaagcagat atcaacatgg gactaaatat gtcagaagtt aattggggga 72360 aatgcctctg aagaataaag ggggaggaag caggtggagt ccaggagact gtacactact 72420 atgctgatct gacacctgtg aaagtggagc aggaaggcat gattgggcat gaagagcatc 72480 agatcacagc acagtcctga gaaagtctca gctaggccat tggagaggga cctagtccga 72540 gctgcccact gaatggacaa gcactaacac acccactatg ctcattcacc agttgagaca 72600 agactggtga aagtgtgtcc tcagcactca cagggcggca gctggaggct atcagccagc 72660 cagactccct actgtgggct gtctttaagg gtgatctgag tgggtcatct ccatggctaa 72720 cattttgctt cacgaaaaat attgtttgct ctttgtagca ctttgaggta gatattatca 72780 attctatgcc agacctagac cctgttaatc ccaataacaa aatatcagtc ccaagtatct 72840 gttccctctg atattactct ctaactttcc agcttgccac cttccagtac cctgactcta 72900 gggattcttt gttccaaatc aacgtccatt gccgggcgcg gtggctcatg cttgtaatcc 72960 cagcactttg ggaggctgag gcaggtggat cacgaggtca ggagtttgag accagcttgg 73020 ccaacatagt gaaaccttgt ctctattaaa aatataaaaa attagcctgg cgtggtggca 73080 ggcacctgta atcctagcta ctcaggaggc tgaggcagga gaattgcctg aacccaggag 73140 gtggaggcag cagtaagcca agatcgtgcc actgcactcc agcctgggtg acagagtgag 73200 actctgtctc aaaaaacaaa caaacaaaca aacaaacaaa atcaaaaatc catttgttga 73260 ccctccatcc actaaccctc cagccattga ctctacttcc ttttgcctga ccacactttc 73320 ctcagtactt ggcttagagt tcgtggttca tcaatatgat catttcctta gatcattcac 73380 cagatatcct caactctttg tctttctttc acctaagaaa accctctgcc tgactgagta 73440 cgactcttaa cctactctgt gcttgagcct gagaactgaa acagtatgac tggcttaccc 73500 aaagttagtg atcaaacact tcaggtggac ctcagtactt tccatgtatc ccttttctct 73560 agccaattca ctcttatact tatggaaaat tgtcttgtac cttctctttc ttcaaatttc 73620 caacatctct tcccactcat tctcaactat aactttgttg agaaaaggtg tctttcaaga 73680 acatgccata ggagttggag ctcaaggata tgtggtcaaa ccccagcttc aaggaaatgt 73740 aatcaagctt agagaaagat gaactccccg annnnnnnnn nnnnnnnnnn nnnnnnnnnn 73800 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 73860 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 73920 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 73980 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 74040 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nctttttatt atattaagca ctttataaac 74100 acttgttttt tgcctgaaga atacttatga gctgagtgcg gtggctcaca cctgtaatcc 74160 cagccctttg ggaggccgag gcgggtggat catgaggtca ggagattgag accatcctgg 74220 ctaacatggt gaaaccccgt ctctactaaa aatataaaaa attagccggg catggtggca 74280 cgcacctgta gtcccagcta gtcaggaggc tgaggctgga gaattgcttg aacccggaag 74340 gcggaggttt cagtgagcca agattgcacc actgcactcc agcctgggca acagagcaag 74400 actccatctc aattaaaaaa aaaaaaaaaa gaatacctct gatgatcagt tgtagggtag 74460 agatgatttg catcccctaa accttcagaa caggcaaggc tttctgactc atgtcatctc 74520 tcatcagaat atctcctcat atttctcagg gtggtggctc attagcctcc ctcctctcac 74580 ttgatgagac acttagcttc gaatcttatc attttcattt gactctgcct gtagggcaac 74640 tatccggtag caatggaaga tcatttccat taggacctgt atttgtttat ttgtttactt 74700 ttttttaaca agcttttatt gagtacttct atgggatagg tacactggcg gtacaaagat 74760 gaataagata aagtcataat cctcaaggac ttcatcttct aggggaaata tatttgaaaa 74820 gaattaatca caatagattt catgcataat ttaagttaga ggtttggctg ctgtaacaga 74880 gacaaaaaat aaagagtgct taaacaagat ggaaatgtat ttctatctta tactagtctg 74940 ggcatgagga gaatgggact gatagggaat cttcatagtg tccctctgtg tcatagttct 75000 atccatacag atggatgtca tgctcatcca tccacagaga gttgacttca tcttcttggt 75060 ccagtatggc tccccaccat tttggtaaaa acacattcca ggccgggcac agtggctcac 75120 acctgtaatc ccagcacttc gggaggccaa cgtgggcaga tcacgaggtc acaagattga 75180 gaccatcctg gctaacacag tgaaacccca tccctactaa aaatacaaaa aattagccgg 75240 gcatggtggc atgtgccggt agtcccagct actctgaagg ctgaggcagg agaatccctg 75300 gaacctggga ggcagaggtt gtaatgagca cagatagtgc cactgcactc cagcctgggc 75360 aacagagtga gactccatct caaaaacaag caaacaaaca aaaacacatt ccagcaagta 75420 gggagggcaa aggggaaaaa gaatggcaca ccttccctgt aagtgaataa tcctgaagtg 75480 ttacacataa cttctactgg cattccattg gcaagaactt aacttcttca tcacaccttg 75540 agacaagaga cctgtaaaga gaactactaa tgcaataact gggtgagaaa tgatgagggt 75600 cttgctggtt agaatgcata taatcaaaaa gaaaaaaatc ttccatgtaa tttcataagt 75660 ttcagaactt gtgatacctc aagtttcaga aaagctaaag aatgaagttt tccagaatat 75720 gatctgtata aataaatatg catgtttcta gtttataata agatactttc tccaaatatc 75780 aaccatatat tgtgacaata aaaaccatcc aaactcacat gctgctgcat attttatcta 75840 taatgttttc atttattaat atttatgaat ttccttctcc cagtgggttt tcagaacatt 75900 gcctcttcca cacaacacat aaaaacctga tctcgcaaga tgacctttca gactaacttt 75960 ttattcttat gctaatacaa tcattcttat ttctattata catttaattc cagagtaaca 76020 taatgctcaa agttactctg tttatgccta tgcaactctt taaagaatat gatgtgaaat 76080 ctcttttctt gttcttcaaa ttcatactct tttgtttgct gttattaaat cccaatcaca 76140 tacacaaaga taactgtctc tgagatgctt ttaatgagaa ttagagtatt tatgttaatt 76200 gaggctttag aacgtatatc tgatataatt tgtgcttgag ttgtttctat acaattctag 76260 tgcatcattg attttattcc taatgtacca ttacttacaa actaggggtt taagttggaa 76320 ttgttaataa aaataggaat aatcatagta tataaaatat tatagactga aataggtttg 76380 ctaacactga atgagtttat attttgtttt gttttgattt tttttgagac aaagtttcat 76440 tcttatcgtc caggctagag tgcaatggca cgatctgggc ttactgcaac ctccgcctcc 76500 tgggttcagg caattctcca gcctcagcct cctgagaagc tggggttaca ggcgacaggc 76560 aacatgtccg gttaattttt tgtattttta gtagagatgg agtttcacca tgttggccag 76620 gctggtctta aactcctaag ctcaggtgat ctttccgcct cagcctccca aagtgctggg 76680 attacaggtg tgagccaccg cgcctggccc aagtttacat tttaacatta aggaaattag 76740 caataacctc tcagagatca aagcaacttg ttctgtccat ttcaaatata aataatccct 76800 gacatttata gtagttttgg gcctttttcc atccccacaa gggacataaa taaaatattc 76860 aattcagtaa gaatttacca gtatgacaat atatctcatg ttttgcaatg ttttgttttc 76920 acatgcatta tcctgttagt gctgctccca atggccctca ctaggtgagt gttatttata 76980 aaatatccta ttttatagat gtagaaatgg gcttagagag gcaagatgac ctgctcaagt 77040 tcatacagct aatcaaaaga gtgcgaggaa attaaggtgg gtattctaac tcagagccca 77100 gcactctttc tatatgtggc atctgtcact caccaatttc taatatttgc ccaactatct 77160 gctagcaggg aatataaagt aaaacatggt ttctgccctt cagttgagag ataaagtata 77220 agcagttgtt agcaatttca aattgcttta agtcagtgat caaatgaata gacaatgaag 77280 tcagagaaag gagataatct tgttagggtg atgaagttgt agcccttaat agtaccatac 77340 taaaatatac tgtgcttatt tttgttatat gatttcactg ctttagcctt ttcagcttgc 77400 acaaaatgaa aacagttacc agaactatgt tttccaaaga tgcaaatcta agctgattat 77460 cagttcagat actgtctttg agaggggtta gaaatgtaaa gaaacaaatg atagactatt 77520 tataagaagg gtaaatgctt tagttaaatt tagaaatctt aacacctcta cttttgcttt 77580 tgaaagtggt cacatgctaa cttttgcatg ggcaaattta ggactcaaat tataggaaga 77640 aaccttcctt ggattaagag gaaacattaa agaaatacaa gtgaggttat ttccattgcc 77700 cctaggcaga atccattgtt cctcagctgt atttcccaag ctttttgtac ttgctcggtt 77760 ttagatatta tgtatttatt tatgtcaagt cgccccttgg tatccacaga ggattggttc 77820 taagatctct cttggatacc agttaatggc tgctcaagtc cctgatataa aatacaacct 77880 ttgtatataa cctatgtaca tcctgttgta tactctaaat cacctccaga tgacttataa 77940 tacctaatac aatgtaagtg gtatgtaaat aattgcttca ctgtattgtt tagggaataa 78000 tgacaagaaa aaagtctgta cacgttcatt acagttgcaa tacttttccc aaatattttt 78060 gatctgtggt tggttgaatc catggataca gaacccacag agagttaact gtatctgctt 78120 cttccattaa cctgttaatt gcctgaggac ttcatattcc cagattccca gcactaatat 78180 agtgcctgac acatagcagg aatacaaata ttggtgaatt ttgttgagtt gaatctggaa 78240 cttgacagtc caaactctgg taactcctaa gccctggtga gacctaatta ggccggatag 78300 tttttagcca ctaggactaa tgggaagcat atcatgtatt atatatttta aattaaatat 78360 tttatactta aagtagttct tttttgaatt gctagtttga tggcagctgt ttacttagga 78420 ccaagagtat ggcacaaatg actccctggg gaaagtttag cagttgaaat agaggacata 78480 gtccaatctc ttagcaccac aagaatatgt ccattgcctt aatggcatcc cttgtcccca 78540 tcacatctca caactatgtc accttcatca taaagaacta cgaaaacata ctactgcctg 78600 atttaacatg catttattaa tctcctatta tgtcttcaga aggatgctag ttattagaga 78660 tgcagaggtg aataagaaat atctggccag tcacagtggc tcactcccat aatcccaaca 78720 tttaaggagg ccaaggtgga aggattgctt aagaccagga gttcgagacc ggcctagaat 78780 acatagcaag tccctgtctc tacaaaaaat aaaatgtctg gatgtggtgg tgcatacctg 78840 tagtcctagc cactcaagag actgaggcag gagaactgct tgagccccca ggaggccaag 78900 gctgcagtgg gttatgattg tgccactgca ctccagcctg ggtgacatag ccagattctg 78960 tcataaaaaa aaaaaaatct atggagcacc ctttatgggt caggcagcat ggcagccact 79020 ggagataaca gtaactgatt cccactcttc tttttttttt gagacggagt ctcgctctgt 79080 cacccaggct agagtgcagt gcagcgatct tagctcgctg caacctccgt ctctcgggtt 79140 caagcaattc tcctgcctca gcctcctgag tagctgggac tacaggcgcg tgccaccaca 79200 cctagctaat ttgttgtatt tttagtagag atggggtttc accgtgttag ccaggatggt 79260 cttgatctct tgaccttgtg atccgcccgc cttggcctcc caaagtgctg ggattacagg 79320 tgtgagtcac tgcgcccggc cgattcccac tcttataaaa tgtatatggt atttcacact 79380 ccagtgagaa ggagagatat gaattgtgat aagttgcaat agaactgaac ctagaaaagg 79440 aagtaactaa ttctgcttct ctgtgacagg gaaagctcca tagaggaagc aaaatttgaa 79500 cttggccttg aaccatggat ggggtgcctt agtctaacag ggatcctgcc ttgtatttca 79560 ggaaaaagca tgaagacata aaaagtgtat cccacttgtt aggggagctg tcagaatgct 79620 cttggccgat agctacagaa aataaactca ttatggccta aaaaagtaag aaaatgtatg 79680 agctcaaaca agaaacccag aggaagggtg acagcagaat tggttaattt aggggcagaa 79740 cccaccatcg gaaacacaga tccatctcat ctttctgctt tgctatcctt ggcgtattga 79800 ccttccccac aggctggttc tcctcctggt catctaatgg ccacagcagt gctaggcact 79860 atgtctgggc acaacactgg tcaaaggaaa aagaaatgct gtcttcctca gtatggaaat 79920 ctttcttagg aacttcccag tagactcccc tttagtctca tgggcccaaa tttcactgct 79980 gtccctaact acaccaacct ctgatgtggg aaatgagacc aacagtggct ggaccttggg 80040 ctggagctgg gatttgcctc taccatagtg tatggccatg gggcagggag ggggaaggga 80100 gcctgtgcag gcaacaagct gaatcagctt taggagcctt agatgaatcg gtgttattag 80160 ttcctaggga aactattggg ctgctgagag attagtttga aaggaaagca gaaggcagat 80220 gtggctttgt gtgccgtgtc agggatttat accttgacct ctgtgctatg gagtgatgtc 80280 atttctagaa aagtttcaag ttacatggtt ttaaggaaat agtatcattt ttaaccagat 80340 gctcagtcca tcttgaggac gtcatcagaa taagtcatat agaggtattt aagatactag 80400 gatatatgtc tagtgtaata tgcatacttg taaataacac acacacacac acacacacac 80460 acacacacct gcttcatttt tgttcccaat tacgactaga aagctaatta ggaaataaaa 80520 acttttaaca tttaaaacac ctaaagcagt atagtgttaa ttctatcaaa tggttattct 80580 ctcaaagaat atttctgatc attgaacata atttattaat cacagacctc tttttaaaaa 80640 agggaaaaca aataggaaac aggagcagaa acataacatt atttttgcat tttcagactt 80700 ttccatggca aagtctttgg tagttaagtt agaaaatagc ccccccttgt gaccaaaaat 80760 caggtaattg gtcattcaag cgaaaatgaa gcaaatttta agcaattctc caggaagaag 80820 gaagcaggaa aacaggaaaa ggctgtcccc ttcaggatga tgttagtgtt acatctccag 80880 tgttagaaat tgctggttca caagcccttt gtttatgttg gcatggttaa ctctgattgt 80940 tgactctaat tccaaaagta gatttatatt tatctatctt gtaggaaaat tattttttaa 81000 ctctcccatt atttcttttt tacattttct ctatattcac tcttggcttt tacaacatgc 81060 acaagatgaa ataatcattt taaaaacttt aaccttattt tccccaaatg ttatcaattg 81120 taattagata gctaatggag gcaacatctc tggagggtgt gtggggattt agaaaggcaa 81180 atgaaatagg aaatttggta atcaaagaaa tttctaaaaa ttttaatatt ttcaaatcag 81240 ctgtcttgaa tatccctaaa tcttcatgta ctatttaatt taatggcata gccataaatt 81300 attattttag gtaaccacaa ctgaggagtg tagtctctga atgctgaaga aagcataaag 81360 attgacattt ttttgaagat gaaaaagaac agccttacca agaaaataga gaacttagtt 81420 tagctttaat acctggaaaa atactgaagc aaatcactga tcagccagtt tgtatttatc 81480 cccagtaatt tgcagaattt gttggttttt taaaacctac atggctttgt aaagacaaaa 81540 gttcacacgt ttaatctaag acttaagatg aactgaaatt aagatggact aatgtgttga 81600 ttcagttctc ctgaataatg ataataatag ctactattat aattatctac tctgacattt 81660 attgaggatt tactatgaac ctggaattgt gctaagcact ctctatatat ctcatataac 81720 actactgtta tatggagtac acattatatt acacgtgaca acatttcaca gatgagaaaa 81780 tgaagtctta aaaaatttaa gtaatgtccc aagaccaagt cgtggagttt ggtttcaaac 81840 tcagagctag cttatgacct cttaaccact aggtttgctg gaagccagga ctggtattct 81900 ccaggctcac attctccaat cctctgcact ttaagttgta tctagttggg tcatgggtat 81960 ttagttcatt ctgtaacagc tggataaagg gcaatcttcc acatgatgta atccccaaat 82020 tgctcaacaa agacattgtt gttaatagag tccactgggg tcaattaaaa aatcaaagtc 82080 aaaggggagc agagggtggt attttgacca gagaggaaaa cggcaaggat ttaatttggg 82140 aagactgttt tttaaaaaat atttaaacat ttattgtgct caaaatttaa atccaaaaaa 82200 attaaataaa tcgtataatt ctctttatag gctgggtgcg gtggctcatg tctgtaatcc 82260 cagcactttg agaggctgag gcgggtggat cacctgaggt cgggagttcg agaccagcct 82320 gaccaacatg gagaaacccc gtatctacta aaaatataaa attagctggg catggtggtg 82380 catgcctgta atcccagcta ctcgggagac tgaggcaaaa gaattgcttg aacctgggag 82440 gcggagtttg cggtgagcct agaccgtgcc attgctctcc agcctgggca acaagagtga 82500 aactccgtct caaagaaaaa aaaaacaaaa aacaaaaaac aactctcttt taaagtacaa 82560 aataattcat caccttcctt tattatttac ccattatttc atctaaatgt ttgtagctaa 82620 gattttctta aagcaggtta cccatgatta cagctgcctg agaggcctgt ctcgtggcat 82680 gtatatatgt ttcaccaaag gcaatttggg aatgtgtgtt ggatcaagat tcagataatg 82740 tagacttggg ttttccattt attatgccag taaacttagt caactcagat tacccattct 82800 gagccttact ttctcaagtc tgaaatgggg atagtaatac tggcttatca gtacttcctg 82860 cacaggattg ttgtacagaa tcaaattcgt ttatatattt gaaaaatatc tacggaaggg 82920 atgggtgtaa ttattgaaaa gaagaaatcc tagcatactt cgggatcttt tactaccatt 82980 tgcatcactg taagatagag aaaaatgaga gctgtgtcca tgcatgttag aagtaatttt 83040 tttccttttt gtattcattt tattttaatt ttttttacat tttattttag attcaggagg 83100 tacatgtgca ggtttgttac atggtgcgtt agtctgtttt gcattgctat aaaggaatac 83160 ctgagactgg gtaatttata aaggaaagag gtttatttgg ttcatggttc tgcaggctgt 83220 gcaagcatga caccagcatc tgctcagctt ctggggaagc cccagggagc ttttactcat 83280 ggtggaaggt gaagggggag caggcatgtc acatggtgag acagggagaa agagagatgc 83340 taggctcttt taaacattca tatttcacag taactaataa agtaagaact cattcattac 83400 catggggagg gcaccaagcc attcatgagg ggtctacccc aatgacccaa acaccttcca 83460 ccaggcccca tctccaacac tgaggattac atttcaacat gagatttaga ggcaaaaaac 83520 atccaaacta tttcacatgg atatattgtg taatgatgtg atttggactt ctggtgaacg 83580 catccctcat ccaaatagtg aacattgtac ccaataggta atctttcaac cctcaccttc 83640 atcccaccct cctgcctttt ggagttgcca gtgtctatta tttccatttg tatgtccatg 83700 tgtatccatt gttcagtgag gacatgcggt atttgatttt ctgcttctga gttatttcac 83760 ttatgatact cttctaacag tgagaattta ggctttctgg taagtatacc ttagatgata 83820 attgtactct ttagcgtctg gaatcaaagt ttattctggt ttgctcagcc aaagctcttt 83880 aactctggat ctggtttgga agaaatggag cagtctattc tcatctggac caatcccagg 83940 aaacatcctc atctcaatta atccataagt cacatcacag gtttttgtgt tggttttgtt 84000 ttgtttttct gtcaccacga ttctccatcc aatggctctt gtggaataaa gcaatgcaca 84060 gaagtggcac taagaagtct gttttttttg tttttgtttt tgtttttaga gatagcatct 84120 tggcccgggc gtggtggctc acgcctataa tcccagcact ttgggaggct gaggtgggtg 84180 gatcacaagg tcagggtttc gagatcagct tggccaatat agtgaaaccc tgtctctact 84240 aaaaatccaa aaaaaaaaaa aaaaaatcag ccaggtgtgg tggcacacac ctgtagtccc 84300 agctacttgg gaggccaagg gacggaggtt gcagtgagcc aagatcacgc cactgcactc 84360 cagcctgggc aacaagagtg agattccatc tcaaaaaaaa aaaaaaaaaa aaaaaaaaag 84420 agacagcatc tcactttgtc acacagactg gagtgcagta gcacaatcac agcttactgc 84480 agcctcaacc tccaggctta agcgatcctc ccacctcagc ctcctgagta gctgggacca 84540 caggcatgtg cccactgcac ccagctaact tttgtagttt ttgtagagat ggggttttgc 84600 catgttgccc aggctagtct caaactcctg agctcaagca atcttccagc cttggcctcc 84660 ttaagtgttg ggattatagg cgtaagccac tgcacctggt tagaagtctt aaatgacaaa 84720 gaggagtgag ctagataaaa gagaaaaggg atatttcaag tagagagcac aacatgtaca 84780 aaaactgggc aatgtgaatt gtttagggaa ctacaagtgg ccgaggtggc tagaggaaag 84840 gcagtgtgat agagtgatag agtgcagctt agccgtagcc actgtcagga ctttggagcc 84900 tgccagcttt ggtttgaaac ctagttccac ctctctcagc cgagtgcctt tgggctagtt 84960 atataacctc tctgtatcta cctcacgggg tttagaacta ttaaatgaat tatggtatat 85020 taaatgctta aaatattgca taatacataa tgagtactcc tactattact attgttattg 85080 tgaatagtta gataggcagg ggacaggtcc tagaggctcc tacgttaaaa ggaagaaaat 85140 taaaactata tgcctaaagc catggggagg tttagaaggt ttttgtaaga agtagagaga 85200 aagtatgtga tctgattttt agaaaggtca ttccaatgag aggtgacagc gtgctggcag 85260 tcctcagagc cctcgcttgc tctcggcacc tcctctgcct gggctcccac tttggcagca 85320 cttgaggagc ccttcagccc accactgcac tgtgggagcc cctttcaggg ctggccaagg 85380 ctggagccca ctcccttagc ttgcagggag gtgtggaggg agagacgcga gcgggaaccg 85440 gggctgcgtg cagcgcttgc aggccagctg gaattccggg tgggcgtggg cttggcgggc 85500 ccctcactcg gagcagccgg ccagccctgc tggccccggg taataaggga cttagcaccc 85560 gggccagtgg ctgcggaggg tgtactgggt cccccagcag tgccagccca ccggcgctgc 85620 gctggatttc tcaccgagcc ttagctgcct tcccgagggg cagggctcgg gacctgcagc 85680 ccgccatgcc tgagcctccc acccactcca tgggctcctg tgcggcccaa gcctccccaa 85740 cgagcaccac cccctgctcc acggcgccca gtcccatcga ccacccaagg gctgaggaat 85800 gcgagcgcag ggcgcatgac tggcagacag ctccacctgc agccctggtg ccggatccac 85860 taggtgaagc cagctgggct cctgagtctg gtggggacgt ggagagtctt tatatctagc 85920 tcagggatta taaacacacc aatcagcacc ctgtgtctag ctcaaggttt gtgagtgcac 85980 caatcgacac tgtatctagc tgctctggtg gggccttgga gaacctttat gcctagctca 86040 gggattgtaa atacaccaat cagcaccctg tgtttagctc aaggtttgtg aatgcaccaa 86100 tcgacactct gtatctagct gctctggtgg ggccttggag aacctgtgtg tggaaactct 86160 gtatctaact aatctgatgg ggacgtggag aacctttgta tctagctcag ggattgtaaa 86220 tgtaccaatc agcgccctga caaaacaggc cactgggctc taccaatcag caggatgtgg 86280 gtggggccag ataagagaat aaaagcaggc tgcccgagcc cgcattggca acccactcgg 86340 gtccccttcc acactgtgga ggctttgttc tttcgctctt tgcaataact cttgctactg 86400 ctcactcttt gggtccacgc tgcttttatg agttgtaaca ctcactgcga agatctgcag 86460 cttcactcct gaacccagcg agaccacgag cccaccagga ggaataaaca actccagacg 86520 tgctgcctta agagctgtaa cactcaccgc aaaggtctgc agcttcactc ctgagccagc 86580 gagaccatga acccaccaga aggaagaaac tccgaacaca tctgaacatc agaagggaca 86640 gactccagat gtgccacctt aagagctgta acaccgcgag ggtccgcggc ttcattcttg 86700 aagtcagtga gaccaaggac ccaccaattc cggacacacc aatacgtgga tagggataga 86760 tcaaagggaa tcaaactaaa gataggaagg ccagtaaaga ggcaactgca gtaatccaga 86820 ggaagagaga tggaagctta aattaagtaa atgccaggtg caataaagag gagagggagc 86880 accaggtgag agtgaggagg agaagccaca ggcctgggtg cctgaatctc agttgtcagg 86940 cctgggcagt tgagtggtgt ggataccact caccgagaat gggaacgcca ggaaagaagt 87000 ggatttggag aaacgtgggg gagcttagtt ctgtggtgtc tttgattttg gacatactgg 87060 gtttgaagag aacagccaat tactgatgct tagtaggtaa tggcttggtg cttcaggaag 87120 agatcagaac cctttgataa gaaccctggc ctggctgggc gcggtggctc acgcctgtaa 87180 tcccagcact ttgagaggcc gaggcgggcg gatcacgagg tcaggagatc aagaccatcc 87240 tggctaacac ggtgaaaccc agtctctact aaaaaatacc aaaaaattag ccgggcgcgg 87300 tggcgggagc ctgtagtccc agctactggg gaagctgagg caggagaatg gcgggaaccc 87360 gggaggcgga gcttgctgta agccgagatc acgccactgc actccagcct gggtgacaga 87420 gcgagactcc atctcaaaaa aaataaaata aataaataaa taaagaaccc tggcccattt 87480 agaaatgaag ctgtctatac ttccagactg aataacaacc actttttgtt agttgtttaa 87540 gatgtgatct ctgccttaat tatcgatagt catattcaaa agattcacta aatcccatct 87600 ggaaaatgga gacgtttttg agattctttt ggcaattcaa gagcattatc attgattatt 87660 agtagtgact gaaacaaaga cactgacaaa attagacata cactcattac actcattgtt 87720 ttggctatat tttgagaatt ttgtcaactg attgtcttca caatgtcctt cctttggaca 87780 tataataaaa ataatatagt cattatttga atgtgtgcat atagttgcaa aattaccatg 87840 tgtttatatg tataatgtgt ttctagtttt acactgttta gtcactcaaa aattgtatta 87900 tgattgcaac tttcccccac aggtatctct gatgaatata taacacctat gtttagtttt 87960 tataaaagta ttggggaact gaaaatgact caagaggagt atgctctgct tacagcaatt 88020 gttatcctgt ctccaggtaa ttccaatgtt acattttaat ttttatgcca tttttttcag 88080 tatactagta ataacgttta ttgaaaaaaa tctggaaaac atagaaatat aaagaaaaag 88140 taaagtagcc tataattctg ctatccaaat agaaccatta ttaacctttc agtgcacttc 88200 ttttcagtct ttttaaaaaa tatcacactt attcatttaa taaatgtata ttgaacacct 88260 actatgtact aaccatgatg ctaagcaccg gggatgcaac aagacacagc tcccatgcag 88320 cttgcattct ggtaggatga gaaagataat aaatgaagaa acaatatttt ttcagaaggt 88380 ggtaaatgct acaaaaaata taatgagaat ataaagaaag agagtggcac aaaaatggta 88440 ttttaatttg tggtcacagg aggctttttt taagtgggtg gcattggagc agggatctaa 88500 atgaaattac atacttttta acataataag gaactttttt ctgagtttat atttcttcta 88560 aaagtcattt cccaagtcat taaaaaatta cattggtaaa gttgtaatat gtgctacata 88620 ttccattgta tgaatataca atcatttact taatctttac cacatgtttg tatattcagg 88680 atatttccag tttttttgat gttgaaaatg tttcttcaac agtttttttt ggttgttgaa 88740 atgatacctc tttccatatt tttttctata ttttttggtt actttaataa tttgggtgtc 88800 gaatgatgcc atgctcatga tttgtagagc ataataacaa tgttagcctt aggccatttt 88860 tatacattat cttgaggcag aagctagttg ttactttgct ttaccataaa aattatgact 88920 tgtgactaaa aaactattca gcagaacctt ctaaagttat gtatggatat gtcctatgac 88980 ccagtgattt cacttctgag aatatactca acagaaagga tgctcacgtc caccctacag 89040 gttatcttta ggggttttga ttggaagggg aattgggagg cttctgggaa gcattacatg 89100 tgttgtatct tgatctcgtt ataattacgt gtgtgtgcat aaaaattcat caggttgaac 89160 acttaatttt cacattttgt gtacaacatt taacttacac ttcaaaatag ttaacgttgc 89220 tataattatg ctgaattaat gcttttccac tttttaattt tgtcatttta ttttaaactt 89280 caaaacagat agacaataca taaaggatag agaggcagta gagaagcttc aggagccact 89340 tcttgatgtg ctacaaaagt tgtgtaagat tcaccagcct gaaaatcctc aacactttgc 89400 ctgtctcctg ggtcgcctga ctgaattacg gacattcaat catcaccacg ctgagatgct 89460 gatgtcatgg agagtaaacg accacaagtt taccccactt ctctgtgaaa tctgggacgt 89520 gcagtgatgg ggattacagg ggaggggtct agctcctttt tctctctcat attaatctga 89580 tgtataactt tcctttattt cacttgtacc cagtttcact caagaaatct tgatgaatat 89640 ttatgttgta attacatgtg taacttccac aactgtaaat attgggctag atagaacaac 89700 tttctctaca ttgtgtttta aaaggctcca gggaatcctg cattctaatt ggcaagccct 89760 gtttgcctaa ttaaattgat tgttacttca attctatctg ttgaactagg gaaaatctca 89820 ttttgctcat cttaccatat tgcatatatt ttattaaaga gttgtattca atcttggcaa 89880 taaagcaaac ataatggcaa caggattttc tttgggaaca aaatttgata aatatacaaa 89940 ctcatttctt taaatgttga aagacatcct attgaatctg agggggtaat ttacagaaag 90000 atactgtctc tcgcactttt gtcatctctg gaaaagtcct ggtgaatagt cattcatttg 90060 taatagccct ctgatcattt ccttttcttt gaagttcaaa gagaatagca aacagaaaag 90120 ttaacaattt tcacataaaa tattccctca agcaatcctt tcccccatag ttagtcttga 90180 ggcaggattc acaaattcag agtatgcagc taactgcagt gatgggacct tcgctttcct 90240 ctcttaaaag tgaggaaatg attggcttca ttgtatccaa tcatctttct gttttgtttt 90300 gcttttgaaa acattcctac agctgggagc acagctgctg tatacttccc ccaaagcagg 90360 aattagttca tttccttcct tcattgttac actgttgcat tcttctccct attatccttt 90420 ggtgaataca ataaaacacc ttctataagt aatgggcgta tatctccctt tctcctaaag 90480 ttagccaaag tggagcatga ttagcaagca gtttttttgt atcaaaatac taggacaacc 90540 ctggaattct aaaaataacc agtcagtgga aaactgtgtt gaggctagca tcaattggat 90600 tctgatacaa caaaaatgtc ttgagttcct agtaagtaca agattcattt gctaaactcc 90660 atggaggata caaaaatgat taagatccag taacagagct gagcccaatt gacacctgta 90720 atcccagcac tttgggagac tgaggctgga ggattgcttg aacccaggag tttgagacca 90780 gcctgggcaa cataggggaa accctgtctc tacaaaaaat aaaaaagaat aaaatcagcc 90840 aggggctgtg gcttgcacct gtggtggtcc cagcaacttg ggaggctgaa gcaagaggat 90900 tgcttgagcc ccagagatgg aggctgcatt gagtcatgat cgcaccactg cactccagcc 90960 tgggcaacag agcgacaccc tgtcttcaaa aaaagagaaa 91000 11 20 DNA Artificial Sequence Antisense Oligonucleotide 11 cttggattgt tttgggtcag 20 12 20 DNA Artificial Sequence Antisense Oligonucleotide 12 aaacccaggt tggaataata 20 13 20 DNA Artificial Sequence Antisense Oligonucleotide 13 acacacagct catccccttt 20 14 20 DNA Artificial Sequence Antisense Oligonucleotide 14 cgcatgtaca tatccatcac 20 15 20 DNA Artificial Sequence Antisense Oligonucleotide 15 cagccaacat tcccatctct 20 16 20 DNA Artificial Sequence Antisense Oligonucleotide 16 tttacactga atttcagtta 20 17 20 DNA Artificial Sequence Antisense Oligonucleotide 17 gcatgctgct tcacattttt 20 18 20 DNA Artificial Sequence Antisense Oligonucleotide 18 gttcagtttt ctccctgcat 20 19 20 DNA Artificial Sequence Antisense Oligonucleotide 19 atgagaaaat tttcttctgc 20 20 20 DNA Artificial Sequence Antisense Oligonucleotide 20 agcaaagcaa tctggtcttc 20 21 20 DNA Artificial Sequence Antisense Oligonucleotide 21 aagtttctta ttgaaaatct 20 22 20 DNA Artificial Sequence Antisense Oligonucleotide 22 gcctctctat cctttatgta 20 23 20 DNA Artificial Sequence Antisense Oligonucleotide 23 tcagcatctc agcgtggtga 20 24 20 DNA Artificial Sequence Antisense Oligonucleotide 24 gggaaacatc cttggatttc 20 25 20 DNA Artificial Sequence Antisense Oligonucleotide 25 tgtacaagaa ctgcatgctg 20 26 20 DNA Artificial Sequence Antisense Oligonucleotide 26 tagctttgga cattaacaac 20 27 20 DNA Artificial Sequence Antisense Oligonucleotide 27 agattatctc ctttctctga 20 28 20 DNA Artificial Sequence Antisense Oligonucleotide 28 ggcatttact taatttaagc 20 29 20 DNA Artificial Sequence Antisense Oligonucleotide 29 atccaatttc gcattaggat 20 30 20 DNA Artificial Sequence Antisense Oligonucleotide 30 gatcccagcg attttgctac 20 31 20 DNA Artificial Sequence Antisense Oligonucleotide 31 ggattctgga ctgagtcttc 20 32 20 DNA Artificial Sequence Antisense Oligonucleotide 32 caaggccctg ggaggattct 20 33 20 DNA Artificial Sequence Antisense Oligonucleotide 33 gggtcagaga tggactttca 20 34 20 DNA Artificial Sequence Antisense Oligonucleotide 34 cttctacctc cttggattgt 20 35 20 DNA Artificial Sequence Antisense Oligonucleotide 35 ttgaggaaat gtccagaaga 20 36 20 DNA Artificial Sequence Antisense Oligonucleotide 36 atgcactttc tttatggtgg 20 37 20 DNA Artificial Sequence Antisense Oligonucleotide 37 tgaaatgcac tttctttatg 20 38 20 DNA Artificial Sequence Antisense Oligonucleotide 38 tttgatccca tccaaatttt 20 39 20 DNA Artificial Sequence Antisense Oligonucleotide 39 taataggatg acgaggaaat 20 40 20 DNA Artificial Sequence Antisense Oligonucleotide 40 gtagaaaccc aggttggaat 20 41 20 DNA Artificial Sequence Antisense Oligonucleotide 41 ttccaggaga gtaccactct 20 42 20 DNA Artificial Sequence Antisense Oligonucleotide 42 gttcatatat tccaggagag 20 43 20 DNA Artificial Sequence Antisense Oligonucleotide 43 ggcatacgcc tgagttcata 20 44 20 DNA Artificial Sequence Antisense Oligonucleotide 44 gctacctcag tttctccctg 20 45 20 DNA Artificial Sequence Antisense Oligonucleotide 45 ttgatcctcc ctgctgacgc 20 46 20 DNA Artificial Sequence Antisense Oligonucleotide 46 ccacaaacaa cacacagctc 20 47 20 DNA Artificial Sequence Antisense Oligonucleotide 47 ctctgtctcc acaaacaaca 20 48 20 DNA Artificial Sequence Antisense Oligonucleotide 48 gtatccagag gctctgtctc 20 49 20 DNA Artificial Sequence Antisense Oligonucleotide 49 atagtggtat ccagaggctc 20 50 20 DNA Artificial Sequence Antisense Oligonucleotide 50 tgcattatag tggtatccag 20 51 20 DNA Artificial Sequence Antisense Oligonucleotide 51 agttaacaag cattcagcca 20 52 20 DNA Artificial Sequence Antisense Oligonucleotide 52 ggtcacttgt cgcaagtcac 20 53 20 DNA Artificial Sequence Antisense Oligonucleotide 53 tgtcgaggtc acttgtcgca 20 54 20 DNA Artificial Sequence Antisense Oligonucleotide 54 tcagttttct ccctgcatga 20 55 20 DNA Artificial Sequence Antisense Oligonucleotide 55 tgaggcatcc tctgtttgtt 20 56 20 DNA Artificial Sequence Antisense Oligonucleotide 56 gttatttcct gaggcatcct 20 57 20 DNA Artificial Sequence Antisense Oligonucleotide 57 cttctgcact gaattcttct 20 58 20 DNA Artificial Sequence Antisense Oligonucleotide 58 attggttgcc atttccgtca 20 59 20 DNA Artificial Sequence Antisense Oligonucleotide 59 agtctgaaat cctggtagct 20 60 20 DNA Artificial Sequence Antisense Oligonucleotide 60 gctgaacgaa ggaacatagc 20 61 20 DNA Artificial Sequence Antisense Oligonucleotide 61 gaaaatctca gctgaacgaa 20 62 20 DNA Artificial Sequence Antisense Oligonucleotide 62 gcccagacgg aagtttctta 20 63 20 DNA Artificial Sequence Antisense Oligonucleotide 63 aattctttct tccaataggt 20 64 20 DNA Artificial Sequence Antisense Oligonucleotide 64 atcagagata ccactatttc 20 65 20 DNA Artificial Sequence Antisense Oligonucleotide 65 cataggtgtt atatattcat 20 66 20 DNA Artificial Sequence Antisense Oligonucleotide 66 ctgtaagcag agcatactcc 20 67 20 DNA Artificial Sequence Antisense Oligonucleotide 67 gtattgtcta tctggagaca 20 68 20 DNA Artificial Sequence Antisense Oligonucleotide 68 aagtggctcc tgaagcttct 20 69 20 DNA Artificial Sequence Antisense Oligonucleotide 69 tcaggctggt gaatcttaca 20 70 20 DNA Artificial Sequence Antisense Oligonucleotide 70 aagtgttgag gattttcagg 20 71 20 DNA Artificial Sequence Antisense Oligonucleotide 71 acaggcaaag tgttgaggat 20 72 20 DNA Artificial Sequence Antisense Oligonucleotide 72 taattcagtc aggcgaccca 20 73 20 DNA Artificial Sequence Antisense Oligonucleotide 73 gtgatgattg aatgtccgta 20 74 20 DNA Artificial Sequence Antisense Oligonucleotide 74 gacatcagca tctcagcgtg 20 75 20 DNA Artificial Sequence Antisense Oligonucleotide 75 aatccccatc actgcacgtc 20 76 20 DNA Artificial Sequence Antisense Oligonucleotide 76 agaaaaagga gctagacccc 20 77 20 DNA Artificial Sequence Antisense Oligonucleotide 77 tatacatcag attaatatga 20 78 20 DNA Artificial Sequence Antisense Oligonucleotide 78 ggaaagttat acatcagatt 20 79 20 DNA Artificial Sequence Antisense Oligonucleotide 79 ctgggtacaa gtgaaataaa 20 80 20 DNA Artificial Sequence Antisense Oligonucleotide 80 gagtgaaact gggtacaagt 20 81 20 DNA Artificial Sequence Antisense Oligonucleotide 81 aaatattcat caagatttct 20 82 20 DNA Artificial Sequence Antisense Oligonucleotide 82 gaagttacac atgtaattac 20 83 20 DNA Artificial Sequence Antisense Oligonucleotide 83 agccttttaa aacacaatgt 20 84 20 DNA Artificial Sequence Antisense Oligonucleotide 84 cctagttcaa cagatagaat 20 85 20 DNA Artificial Sequence Antisense Oligonucleotide 85 ttttccctag ttcaacagat 20 86 20 DNA Artificial Sequence Antisense Oligonucleotide 86 caaaatgaga ttttccctag 20 87 20 DNA Artificial Sequence Antisense Oligonucleotide 87 agattgaata caactcttta 20 88 20 DNA Artificial Sequence Antisense Oligonucleotide 88 tgtttgcttt attgccaaga 20 

What is claimed is:
 1. A compound 8 to 50 nucleobases in length targeted to a nucleic acid molecule encoding human FXR, wherein said compound specifically hybridizes with said nucleic acid molecule encoding human FXR and inhibits the expression of human FXR.
 2. The compound of claim 1 which is an antisense oligonucleotide.
 3. The compound of claim 2 wherein the antisense oligonucleotide has a sequence comprising SEQ ID NO: 11, 13, 14, 15, 16, 17, 18, 19, 20, 22, 25, 26, 27, 28, 30, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 44, 45, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 64, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 78, 80, 82, 83, 84, 85, 87 or
 88. 4. The compound of claim 2 wherein the antisense oligonucleotide comprises at least one modified internucleoside linkage.
 5. The compound of claim 4 wherein the modified internucleoside linkage is a phosphorothioate linkage.
 6. The compound of claim 2 wherein the antisense oligonucleotide comprises at least one modified sugar moiety.
 7. The compound of claim 6 wherein the modified sugar moiety is a 2′-O-methoxyethyl sugar moiety.
 8. The compound of claim 2 wherein the antisense oligonucleotide comprises at least one modified nucleobase.
 9. The compound of claim 8 wherein the modified nucleobase is a 5-methylcytosine.
 10. The compound of claim 2 wherein the antisense oligonucleotide is a chimeric oligonucleotide.
 11. A compound 8 to 50 nucleobases in length which specifically hybridizes with at least an 8-nucleobase portion of an active site on a nucleic acid molecule encoding human FXR.
 12. A composition comprising the compound of claim 1 and a pharmaceutically acceptable carrier or diluent.
 13. The composition of claim 12 further comprising a colloidal dispersion system.
 14. The composition of claim 12 wherein the compound is an antisense oligonucleotide.
 15. A method of inhibiting the expression of human FXR in cells or tissues comprising contacting said cells or tissues with the compound of claim 1 so that expression of human FXR is inhibited.
 16. A method of treating a human having a disease or condition associated with FXR comprising administering to said human a therapeutically or prophylactically effective amount of the compound of claim 1 so that expression of FXR is inhibited.
 17. The method of claim 16 wherein the disease or condition is a cardiovascular disease.
 18. The method of claim 16 wherein the disease or condition is atherosclerosis.
 19. The method of claim 16 wherein the disease or condition is characterized by hypercholesterolemia.
 20. The method of claim 16 wherein the disease or condition is characterized by increased levels of serum cholesterol. 